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The  problem  of  testing  for  difference  in  survival  times  using  randomly  right  cen- 
sored bivariate  data  has  been  considered  by  several  researchers.  Among  them  are 
Woolson  and  Lachenbruch  who  proposed  a class  of  statistics  for  testing  the  null  hy- 
pothesis {Ho)  of  bivariate  symmetry  in  Biometrika,  1980.  Popovich  and  Rao  proposed 
an  alternative  to  the  Woolson- Lachenbruch  class  of  statistics  in  Communications  in 
Statistics  14(9),  1985.  The  statistics  in  this  class  provided  a means  of  performing 
conditionally  distribution-free  exact  tests  of  Hq-  Dabrowska  used  a counting  pro- 
cess representation  to  derive  the  asymptotic  distributions  of  members  of  Woolson- 
Lachenbruch  class  of  statistics  under  fixed  and  contiguous  alternatives  in  the  Journal 
of  the  American  Statistical  Association  85,  1990. 

In  this  dissertation,  counting  process  representations  are  used  to  study  the  prop- 
erties of  Popovich-Rao  class  of  statistics.  Efficacies  of  the  members  of  this  class  are 
derived  using  a contiguous  sequence  of  parametric  alternatives.  The  form  of  the 
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statistic  with  maximum  efficacy,  to  be  called  optimal  statistic,  is  derived  for  both 

Popovich-Rao  and  Woolson-Lachenbruch  classes. 

Efficacies  of  the  optimal  statistics  are  compared  with  the  other  members  using  a 
log-linear  model.  Asymptotic  Relative  Efficiency  (ARE)  of  the  optimal  Popovich-Rao 
statistic  with  respect  to  optimal  statistics  in  other  classes  improves  as  the  correlation 
between  the  survival  times  increases.  Since  the  form  of  optimal  statistics  involve 
weights  which  depend  on  the  unknown  joint  distribution  of  survival  times  in  a com- 
plicated manner,  a method  is  proposed  for  estimating  the  weights  from  observed 

data. 

The  performance  of  optimal  statistics  is  evaluated  in  terms  of  powers  using  a sim- 
ulation study  with  several  survival  distributions,  censoring  patterns  and  sample  sizes. 
Simulation  studies  indicate  that  the  power  of  the  optimal  statistic  in  each  class  is 
higher  than  that  for  other  members  in  the  same  class.  The  results  of  power  compar- 
isons are  consistent  with  those  obtained  from  ARE  comparisons.  Furthermore,  the 
powers  indicate  that  the  optimal  weights  are  robust  with  respect  to  the  specifications 

of  underlying  parameters. 
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CHAPTER  1 
INTRODUCTION 


1.1  Censored  Life-Time  Data 

In  many  experiments,  one  is  interested  in  comparing  two  drugs  to  see  if  there  is  a 
difference  in  the  lengths  of  time  the  drugs  (treatments)  are  effective,  or  if  there  is  a 
difference  in  the  lengths  of  time  the  drugs  are  taking  to  cure  a certain  disease.  The 
time  period  during  which  a treatment  is  effective  is  called  the  survival  time  or  the 
lifetime  of  the  treatment.  If,  for  a particular  subject,  the  treatment  is  still  effective  at 
the  time  of  termination  of  the  experiment,  then  the  corresponding  survival  time,  i.e. 
the  time  period  in  which  a treatment  is  effective,  is  longer  than  the  observation  time 
for  that  subject.  In  that  case  the  true  survival  time  is  unknown  and  the  survival  time 
is  said  to  be  right  censored  at  the  length  of  time  the  subject  was  under  observation. 
Sometimes  at  the  time  of  the  first  observation  of  a treated  subject,  the  treatment 
is  already  ineffective  so  that  the  corresponding  survival  time  is  not  observable,  but 
known  to  be  less  than  the  observation  time  of  that  subject.  In  that  case  the  survival 
time  is  said  to  be  left  censored  at  the  observation  time. 

It  should  be  noted  that  censored  data  can  arise  in  cases  where  responses  are  not 
measured  on  a time-scale.  Miller  (1981)  gives  an  example  from  Leavitt  and  Olshen 
(1974)  where  the  measured  response  is  the  amount  of  money  paid  on  an  insurance 
claim  on  a certain  date.  For  this  example,  an  uncensored  response  would  be  the 
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amount  paid  on  the  total  claim,  while  a right  censored  response  would  be  the  amount 
paid  to  date  if  the  total  claim  is  unknown. 

Censoring  can  arise  in  different  situations  due  to  experimental  restrictions  on  the 
observation  time.  It  can  be  broadly  classified  into  three  categories.  Type  I censoring 
occurs  when  the  period  of  observation  for  each  subject  is  preset  to  be  a certain  length 
of  time  T,  called  fixed  censoring  time.  In  such  a case  survival  times  longer  than  T 
would  be  right  censored  while  those  that  are  less  than  T would  be  observed  and  are 
failure  times.  Type  II  censoring  occurs  when  the  period  of  observation  will  terminate 
as  soon  as  the  rth  failure  is  observed  where  r is  a predetermined  integer  less  than  n, 
the  number  of  subjects  involved  in  the  study.  Then  the  n — r survival  times  which  are 
unobserved  are  right  censored  at  the  rth  failure  time.  The  third  form  of  censoring 
is  called  random  censoring.  Random  censoring  occurs  when  subjects  have  their  own 
observation  times  which  need  not  be  the  same  for  all  subjects.  In  most  clinical  trials, 
subjects  enter  the  study  at  different  times  and  each  subject’s  observation  time  is  the 
elapsed  time  between  entry  and  the  end  of  the  study.  Clearly,  if  a subject’s  survival 
time  is  greater  than  the  subject’s  observation  time,  then  a right  censored  survival 
time  results;  otherwise,  a failure  time  is  observed  for  the  subject. 


1.2  Overview  of  the  Manuscript 


This  dissertation  is  concerned  with  the  problem  of  testing  the  equality  of  two 
survival  distributions  using  matched  pairs  of  randomly  right  censored  survival  times. 
Survival  times  may  occur  in  pairs  either  naturally  or  by  experimental  design  (Kalb- 
fleisch  & Prentice,  1980).  Examples  of  natural  matched  pairs  are  twin  studies  or 
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the  studies  in  which  two  survival  times  are  recorded  on  the  same  individual  or  piece 
of  equipment.  The  other  case  of  paired  data  arises  in  a matched  pair  study  where 
individuals  sharing  certain  characteristics  are  assigned  to  a pair.  For  comparing  the 
effect  of  two  drugs,  patients  with  similar  characteristics  are  paired,  then  each  receives 
a drug  at  random  and  are  kept  under  observation. 

In  what  follows,  it  is  assumed  that  members  from  a given  pair  have  equal  censoring 
times.  When  survival  times  are  recorded  on  the  same  individual,  censoring  time  is 
the  same  for  both  observations  in  the  pair.  But,  when  pairs  are  matched  through 
experimental  designs,  censoring  time  can  be  different  for  the  two  members  in  the 
pair.  In  that  case,  the  minimum  of  the  two  censoring  times  is  taken  as  the  common 
censoring  time  for  the  pair.  It  is  also  assumed  that  censoring  is  independent  of  the 
treatments  under  study  and  the  censoring,  if  it  occurs,  is  right  censoring. 

Chapter  2 presents  some  key  notation  along  with  a brief  literature  review  and  a 
description  of  the  general  problem.  Chapter  3 is  devoted  to  developing  efficacies  of 
statistics  belonging  to  specific  classes  considered  by  Woolson  & Lachenbruch  (1980) 
and  Popovich  & Rao  (1985).  The  notion  of  an  optimal  statistic  for  each  class  is 
defined.  A model  is  proposed  in  Chapter  4 for  practical  implementation  of  tests 
based  on  optimal  statistics.  The  results  of  a simulation  study  and  an  illustration 
with  real  data  are  presented  in  Chapter  5. 

The  referencing  system  in  this  manuscript  will  follow  the  convention  of  numbering 
equations,  theorems  and  lemmas  within  section.  However,  Tables  and  Figures  will  be 
numbered  within  chapter.  Theorems  and  lemmas  are  counted  within  a section  and 
are  numbered  independently  of  one  another.  For  example,  the  first  theorem  in  the 
second  section  of  Chapter  2 is  Theorem  2.2.1,  and  the  first  lemma  is  Lemma  2.2.1. 


CHAPTER  2 

FORMULATION  OF  THE  PROBLEM 

2.1  Introduction 


Some  notation  needed  to  develop  the  topic  of  this  dissertation  is  described  in 
Section  2.2  while  Section  2.3  describes  the  general  problem  with  a brief  historical 
survey  of  the  relevant  literature.  Section  2.4  contains  a description  of  the  specific 
objective  of  this  dissertation. 


2.2  Notation 


Let  (ATi,  X2)  be  a bivariate  random  vector  with  a continuous  density  function 
^{xi,  X2).  Let  C be  an  independent  continuous  random  variable  with  cumulative 
distribution  function  (c.d.f.)  G and  survival  function  G = 1 — G.  We  shall  refer 
to  Xi  and  X2  as  the  survival  times  and  to  C as  the  censoring  time.  The  observed 
survival  times  are 

Ti  = min(A:i,  C) 

and 

¥2  = min(A:2,  C). 

In  addition  to  Ti  and  Y2,  we  also  observe, 


4 


and 


0,  otherwise. 


1,  if  X2  < C 
0,  otherwise. 


The  indicator  random  variables  61  and  62  specify  whether  an  observed  survival  time 
is  a censored  or  a true  survival  time. 

Let  Z = Y2  — Yi  he  the  observed  dilference  between  the  survival  times  in  a pair 
and  e = sign  (Z),  where 


f 1,  iiZ  >0 


sign  (Z)  = < 


0,  \{  Z = 0 


l-l,  ifZ<0. 


The  censoring  mechanism  is  such  that  e = 0 and  Z = 0 whenever  ^1  = ^2  = 0.  Also, 
€ = 1 whenever  = 1 and  62  = 0,  and  e = — 1 whenever  (Si  = 0 and  ^2  = T 

Let  (ATii,  A'2t)  and  Ci,i  = 1, 2,  3 . . . , n be  independently  and  identically  distributed 
(i.i.d.)  as  (Ai,  A2)  and  C respectively.  Also,  let 

n,  = min(Afci,  Ci), 

Zi  - Yu  - Y2i  and  = sign  (Z,  ), 
for  fc  = 1, 2,  z = 1, . . . , n. 

An  observed  (Ti^,  y2t)  can  be  classified  into  one  of  five  classes  based  on  the 
(the  sign  of  Zi)  and  (^li,  ^2t)  (the  censoring  pattern  of  the  (Tii,  i2t))-  The  classes 
are  defined  by  the  sets  of  indices  : 
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B2  = {i:  ei  = -1,  6u  = = 1}, 

B3  = {i  : 6u  = 1 and  ^2t  = 0}, 

B4  = {t  : = 0 and  S2i  = 1}, 

Bs  = = ^2i  = 0}.  (2.2.1) 


If  Yu  and  Y2i  are  both  uncensored,  then  for  i E Bi  U B2,  Zi  = Xu  — is  the 
difference  between  the  actual  survival  times  for  the  ith  pair.  In  that  case,  Z{  will  be 
called  an  uncensored  difference  and  the  corresponding  pair  of  observed  survival  times 
will  be  called  an  uncensored  pair. 

If  exactly  one  of  the  survival  times  in  (Fii,  Y2i)  is  censored,  then  i G B3  U B4. 
The  corresponding  Z,  will  be  referred  to  as  a censored  difference  and  the  pair  will  be 
called  a singly  censored  pair.  In  particular,  it  should  be  noted  that  if  f G B3,  then 
Zi  < X2i  — Xu  i-e.  the  observed  difference  is  less  than  the  true  difference.  Similarly, 
the  observed  difference  is  greater  than  the  true  difference  for  i G B4.  Finally,  if 
i G 5s,  then  Zi  = 0 and  we  shall  refer  to  (Yii,  Y2i)  as  a doubly  censored  pair. 

Let 


i\T„  = Total  number  of  uncensored  pairs 


where  Nu+  and  Nu-  are  the  total  numbers  of  uncensored  positive  and  negative  dif- 
ferences, respectively. 


Nc  = Total  number  of  singly  censored  pairs 


n 


n 


= ^ (1  ~ ^2t)  + ^ (1  ~ ^1:)  ^2t 

i=l  t=l 

= Nc^  + Nc->, 
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where  Nc^  and  Nc-  are  the  total  numbers  of  singly  censored  positive  and  negative 
differences  respectively,  and 

Nd  = Total  number  of  doubly  censored  pairs 

i=l 

The  various  statistics  to  be  considered  in  this  manuscript  will  be  represented  using 
the  following  four  counting  processes  : 

n 

Nj{t)  = ; = 1,2,3,4, 

1=1 

where 


— ^[  \ ^ ^ Bj],  i = 1, 2, . . . n,  j = 1,2, 3, 4, 


are  the  processes  that  count  the  occurrences  of  uncensored  and  singly  censored  ab- 
solute differences. 

Following  Dabrowska  (1990),  we  define  the  subsurvival  function  Fj  as 
Fj{t)  = P{\Zi\>t,  t e Bj),  ; = 1,2,3,4, 


and 

nt)  = (2-2.2) 

J=1 

where  P{A)  stands  for  the  probability  of  the  occurrence  of  event  A.  Note  that 
F_,(0)  = P{i  G Bj)  and 

X:F,(0)  = 1-po<1, 

i=i 

where  po  is  the  probability  of  a doubly  censored  observation  given  by 


Po  — P{i  ^ 

- nn:  ip{x,  y)  dx  di/|  dG{u). 


(2.2.3) 
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The  improper  densities  corresponding  to  the  four  subsurvival  functions  can  be 


expressed  as 


and 


/i(0  = 


= j G(u)  ip(u  — t,  u)  du, 


f2{t)  = 


Mt)  = 


J G{u)  V'(u,  u — t)  du, 


= /{/  Hu-t,  y)dy}dG{u), 


= j { J u-t)  dj/|  dG{u). 


(2.2.4) 


Let  Fu(')  denote  the  conditional  survival  distribution  of  \Zi\  given  the  difference 
is  uncensored  i.e.,  given  both  Yu  and  1^21  are  uncensored.  If  p„,  the  probability  of 
observing  an  uncensored  pair  is  nonzero  then 


= P(  I Z.  I > t,  f G Pi  U P2) 
(fi(t)  + F,(t)) 

P(i  € Bi  U B2) 


(fi(()  + f2(0) 
(F,(0)  + F,(0)) 


Vu 

where  F*{-)  is  the  subsurvival  function  of  jZij  for  i G Pi  U P2- 


(2.2.5) 
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An  unbiased  estimate  of  Fu{t)  is  given  by  the  conditional  empirical  survival  dis- 


tribution, 


K{f)  = 


Kit) 


p» 


A 

where  F*{*)  and  are  the  sample  analogs  of  F*{-)  and  respectively. 

The  notation  described  in  this  section  will  be  used  frequently  in  the  remainder  of 
this  dissertation.  Other  notation  necessary  in  the  manuscript  will  be  introduced  as 
needed. 


2.3  Literature  Review 


The  problem  of  testing  for  the  difference  in  survival  times  using  randomly  right 
censored  bivariate  data  has  been  considered  by  several  researchers.  Among  them  are 
Woolson  and  Lachenbruch  (1980)  who  proposed  a class  of  statistics  (to  be  called  the 
WL-class)  for  testing  the  null  hypothesis, 


Ho  : X2)  is  symmetric  about  0, 


against  the  alternative. 


Hi  : V^(xi,  X2)  is  not  symmetric  about  0. 


Let  <(i)  < t(2)  . . . < t{Nu)  denote  the  ordered,  absolute,  uncensored  differences, 
(i.e.,  the  ordered  \Zi\  in  B\  U B2)  and  e(j)  be  the  corresponding  to  Furthermore, 
let  and  denote,  respectively,  the  ordered  values  of  the  absolute  differences 
in  B3  U B4  which  are  contained  in  the  interval  and  the  corresponding 

52-^7l(t(|j;)),  J 1,  . . . , 7Z{^  ^(0)  ~hOO. 
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In  the  notation  of  Woolson  and  Lachenbruch  (1980),  a statistic  in  the  WL-class 


can  be  expressed  as 

T = + (2.3.1) 

1=1  t=0  j=i 

where  Si  and  5^,  i = are  scores  that  depend  only  upon  the  form  of 

X2),  the  joint  density  of  (Xi,  X2). 

For  small  sample  sizes,  Woolson  and  Lachenbruch  (1980)  proposed  a conditional 
test  of  Ho  using  the  observed  scores  Si  and  z = 1,  . . . iV^.  The  conditional  test  is 
to  be  based  on  the  permutation  distribution  of  T generated  by  the  2^  equally  likely 
assignments  of  signs  to  the  scores.  For  large  sample  sizes,  a test  of  Hq  can  be  based 
on  the  asymptotic  normality  of  T,  a property  established  by  Dabrowska  (1990). 

Woolson  and  Lachenbruch  (1980)  developed  the  form  of  Si  and  Si  under  the 
assumption  that  (Xi,  ^^2)  satisfies  the  log-linear  model  : 

logXi  — ^ + 771  + I/, 

logX2  = 712  + ^1  (2.3.2) 


where  9 is  an  unknown  parameter,  771, 772  and  u are  mutually  independently  distributed 
random  variables  and  D*  = logXi  — logX2  has  a specified  density  function  (f>  that  is 
symmetric  about  0. 

Censored  data  versions  of  Sign  and  Wilcoxon  signed  rank  statistics  correspond 
to  using  double  exponential  and  logistic  density  for  the  density  of  Z)*,  respectively. 
These  are  the  two  cases  explicitly  considered  by  Woolson  and  Lachenbruch.  In  what 
follows,  we  will  refer  to  the  censored  data  versions  of  the  Wilcoxon  signed  rank 
statistic  and  the  Sign  statistic  as  the  WL  Wilcoxon  signed  rank  statistic  and  the  WL 
Sign  statistic  respectively. 

Although  the  WL-class  of  statistics  provide  a reasonable  method  for  testing  iZo, 
due  to  the  complicated  functional  form  of  the  members  of  this  class,  it  is  difficult  to 
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investigate  their  theoretical  properties.  Also,  for  small  sample  test,  determination  of 
the  critical  values  needed  for  the  permutation  test  can  be  tedious. 

Popovich  and  Rao  (1985)  proposed  an  alternative  to  the  WL-class  of  statistics. 
The  statistics  in  this  class  (to  be  called  the  PR-class)  provided  a means  of  performing 
conditionally  distribution-free  tests  of  Hq  vs  H\.  Just  as  the  statistics  in  the  WL- 
class,  the  statistics  in  the  PR-class  are  based  on  ranks  of  absolute  differences  |Zt|, 
the  signs  and  the  censoring  indicators  ((^ii,  ^2i)- 

The  statistics  in  the  PR-class  can  be  expressed  in  the  form  : 

Tn{N^,N,)  = {1-Ln)'^  T:  + LI  T: 

where  T*  and  T*  are  standardized  (under  Hq)  versions  of  statistics  and  Tc  such 
that,  Tu  is  a function  of  (|Z»|,  6^)  for  z G 5i  U B2  and  Tc  is  a function  of  for 
i ^ B3L)  B4.  The  coefficient  Ln  is  a function  of  N^,  and  Nc  such  that, 

(i)  0 < < 1, 


(ii)  Ln  = L + Op(l)  as  n — >•  00  for  some  constant  L. 

Any  statistic  for  testing  the  symmetry  of  the  conditional  distribution  of  Zi,  given 
i e BiU  B2,  is  reasonable  for  testing  Hq.  One  such  statistic  is  the  Wilcoxon  signed 
rank  statistic  (Randles  & Wolfe,  1979)  applied  to  the  given  i ^ B\  U 62-  The 
corresponding  statistic  has  the  form. 


= T,  nZi)Rt 

ieBiUB2 


(2.3.3) 


where  Rf  denotes  the  rank  of  | Zi  |,  in  the  set  {\Zi\  , z E U J52},  and  ^(Z,)  = 1 
or  0 depending  on  Z^  > 0 or  Zi  < 0,  respectively. 

Popovich  and  Rao  (1985)  selected  Wilcoxon  signed  rank  statistic  as  due  to  its 
high  efficiency  for  a wide  variety  of  distributions  of  the  Zi  and  availability  of  tables 
of  critical  values  for  a large  selection  of  sample  sizes. 
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In  their  search  for  a candidate  Tc,  they  showed  that,  conditional  on  = n„,  and 
Nc  = ric,  any  statistic  based  on  Nc+  and  Nc~  is  independent  of  any  statistic  based  on 
{\Zi\,  €i)  for  i €.  BiU  B2.  In  particular,  the  Sign  statistic  (Randles  & Wolfe,  1979) 
based  on 

T,  = - N,.  (2.3.4) 

is  conditionally  independent  of  the  statistic  based  on  in  (2.3.3).  Since  Tc  can  also 
be  used  for  testing  Hq,  the  linear  combination 

r„(iv„,  i\i,)  = (i-L„)5  + r;,  (2.3.5) 

is  a member  of  the  PR-class.  The  statistics  in  this  class  are  conditionally  distribution- 
free  under  Hq.  Since  T„  and  Tc  are  conditionally  independent,  exact  critical  values 
for  T„(iVu,  Nc)  can  be  easily  constructed  from  the  critical  values  for  Wilcoxon  signed 
rank  and  Sign  statistic  which  are  widely  available. 

The  exact  forms  of  T*  and  T*  at  (2.3.5)  can  be  determined  by  noting  that  under 
Ho,  the  conditional  means  and  variances  of  r„  and  Tc  are 


and 


/iu  = Tu  I iV„  = n„,  Nc  =ric] 

1 , 

— (^u  d”  ^)» 

Var[  I N„  = Nc  = ric] 

1 


= 


24 


U-tx  (u-y  -f-  1)  (2tx,x  ”b 


fic  = E[Tc\  Nu  = n„,  Nc  ■ ric 


= 0, 


<7^  = Var[  Tc\  A^u  = Nc  = 


He- 


(2.3.6) 
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Therefore, 

t:  = [i\r„(iV„  + l)(2iV„  + l)/24]-5  [r„-iV„(iV„  + l)/4], 

and 

r;  = AT<r^  Te. 

Assuming  some  regularity  conditions,  Popovich  and  Rao  (1983)  also  showed  that 
the  asymptotic  null  distribution  of  Tn{Nu,  Nc)  at  (2.3.5)  is  a standard  normal  distri- 
bution. 

A simulation  study  was  done  by  Popovich  and  Rao  (1983)  to  compare  the  powers 
of  tests  based  on  four  statistics  in  the  PR-class  to  the  power  of  WL  Wilcoxon  signed 
rank  statistic.  The  statistics  in  the  PR-class  were  determined  by  the  following  four 
forms  of  Ln- 

When  nothing  is  known  about  the  underlying  distribution,  a reasonable  choice 
for  Ln  is  0.5,  leading  to  the  first  statistic 

T-EQ  = [(0.5)°-®  T*  + (0.5)°-®  T*].  (2.3.7) 

Other  choices  of  Ln  considered  by  Popovich  and  Rao  are,  proportional  to  Nc, 
resulting  in  the  statistic 

T - SQR  = {Nn  -b  Nc)-°-^  [{NX^  K + {Ncf-^  T,*],  (2.3.8) 

Ln  proportional  to  N^,  yielding  the  third  statistic 

T-55  = (Nl  + [{Nn)  T:  + {Nc)  T:],  (2.3.9) 

and  Ln  proportional  to  in  (2.3.6),  leading  to  the  fourth  statistic 


T - STD  ^ {al  + [TJ  cr„  + T;  <t,]. 


(2.3.10) 
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The  simulation  study  involved  four  distributional  forms  for  Zy,  and  generated 
data  in  such  a way  that  uncensored  and  singly  censored  pairs  were  approximately 
75%  and  20%  of  the  total  sample  size.  For  each  case,  500  random  samples  of  size  50 
were  generated. 

The  authors  concluded  that  none  of  the  statistics  emerged  as  the  best  statistic  for 
all  distributional  forms.  For  light-tailed  distributions  they  recommended  the  statistics 
based  on  T-EQ  for  small  to  moderate  sample  sizes  and  WL  Wilcoxon  signed  rank 
statistic  if  sample  size  is  large. 

Although  not  much  difference  in  large  sample  powers  was  observed,  Popovich  and 
Rao  (1985)  mentioned  that  the  selection  of  test  procedure  for  testing  should  not 
only  take  into  account  the  power  of  the  test  procedure,  but  also  should  be  based  on  the 
availability  of  critical  values  and  the  level  of  difficulty  involved  in  the  implementation 
of  the  test  procedures.  As  mentioned  earlier,  critical  values  of  statistics  A^c) 

when  sample  size  is  small  are  easy  to  generate  but  that  is  not  the  case  for  WL 
Wilcoxon  signed  rank  statistic.  For  large  sample  sizes,  critical  values  for  Tn{Nuy  Nc) 
are  obtained  by  using  the  standard  normal  distribution. 

Woolson  and  Lachenbruch  (1980)  claimed  that  the  conditional  null  distribution  of 
the  statistics  in  the  WL-class  have  asymptotic  normal  distribution  as  min  (W^,  Nc)  — ^ 
oo.  Until  Dabrowska  (1990)  established  the  asymptotic  normality  of  T in  (2.3.1),  it 
was  not  clear  if  its  asymptotic  normality  holds  unconditionally  as  the  overall  sample 
size,  n oo. 

For  selecting  the  coefficients  Ln,  it  was  very  natural  to  consider  the  possibility  of 
selection  based  on  the  conditional  Pitman  Asymptotic  Relative  Efficiency  (PARE) 
as  min  (n^^,  n^)  — ^ oo.  However  Popovich  and  Rao  (1985)  noted  that  the  expressions 
for  PARE  were  complicated  and  they  could  not  simplify  them  successfully. 
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Dabrowska  (1990)  represented  the  WL-class  of  statistics  using  score  functions  and 
counting  processes  in  the  form, 


T = 


- S{t))  d{N^{t)  - N2{t)) 


+ J Jc(l- S{t))d{N3{t)- (2.3.11) 

A 

where  S{t)  denotes  the  Kaplan-Meier  (1958)  estimator  of  P(|.^|  > t)  based  on  the 
Zi\,  i € Uy_jSj.  Also,  Ju{‘)  and  Jc{‘)  are  score  functions  defined  on  [0, 1]  such  that. 


(2.3.12) 


Dabrowska’s  choice  of  score  functions  was  motivated  by  the  fact  that  the  censored 
data  version  of  the  signed  rank  statistics  considered  by  Woolson  and  Lachenbruch 
(1980)  satisfied  (2.3.12).  For  testing  the  null  hypothesis  of  bivariate  symmetry  under 
log-linear  models,  (2.3.2),  the  score  functions  of  Woolson  and  Lachenbruch  (1980) 


are. 


J„(v)  = and  Mv)  = 2 


(f){w) 


(1  -wY 


where  tt;  = $(1  -f  |)  and  $ is  the  c.d.f.  corresponding  to  (f). 

As  mentioned  earlier,  censored  data  versions  of  Sign  and  Wilcoxon  signed  rank 
statistics  correspond  to  substituting  double  exponential  and  logistic  density  for  <f)  re- 
spectively and  these  are  the  two  cases  explicitly  considered  by  Woolson  and  Lachen- 
bruch. Dabrowska  (1990)  also  considered  these  two  special  cases  of  T.  The  corre- 


sponding scores  are. 


Ju{v)  = Jc{v)  = 1, 


for  T to  be  the  same  as  WL  Sign  statistic  and 

1 V 

Ju{v)  = V and  Jc{v)  =2  + 9’ 

for  T to  be  asymptotically  equivalent  to  WL  Wilcoxon  signed  rank  statistic. 
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Dabrowska  used  the  representation  at  (2.3.11)  to  investigate  the  properties  of  WL 
signed  rank  statistics.  She  derived  the  asymptotic  distributions  of  the  statistics  under 
Ho  and  as  well  as  under  a sequence  of  contiguous  alternatives  and  examined  their 
Pitman  asymptotic  relative  efficiencies  using  a log-linear  model. 


2.4  Objective 


The  general  objective  in  this  dissertation  is  to  investigate  the  properties  of  the  PR- 
class  and  WL-class  of  statistics  with  a view  to  develop  statistics  for  tests  of  Hq  that 
are  optimal  with  respect  to  appropriately  chosen  parametric  alternatives.  Following 
is  a list  of  special  tasks  undertaken  to  achieve  the  above  stated  goals. 

1.  Use  Dabrowska’s  (1990)  representation  of  rank  tests  to  derive  the  efficacies 
of  members  of  the  PR-class  of  statistics. 


2.  Determine  the  optimal  statistics  in  the  PR  and  WL-classes. 

3.  Compare  the  efficacies  of  the  optimal  statistics  with  the  others  using  a 
log-linear  model. 


4.  Develop  methods  for  practical  implementation  of  optimal  tests. 

5.  Compare  powers  of  optimal  tests  with  others  using  simulation. 


We  assume  throughout  the  remainder  of  this  manuscript  that  the  assumptions 


listed  below  are  valid. 
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Assumptions  : 

(AI).  (Xu,  A'2i),  i = are  i.i.d.  as  a continuous  bivariate  random 

variable  (Xi,  X2)  with  density  ^2)- 

(All).  Ci,  i = 1, . . . ,n  are  i.i.d.  as  a continuous  random  variable  C and  are 
independent  of  (Xij,  X2t),  i = 1, . . . , n. 

(AIII).  The  probability  of  double  censoring  is  strictly  less  than  1.  That  is, 
po  = P(A'i  >C,  X2>C)  < 1. 


CHAPTER  3 
EFFICACIES 

3.1  Introduction 


In  this  Chapter,  we  use  counting  process  representations  to  study  the  properties 
of  the  PR-class  of  statistics.  Asymptotic  distributions  of  these  statistics  under  the 
null  hypothesis  of  bivariate  symmetry  are  given  in  Section  3.2.  In  Section  3.3,  the 
efficacies  of  the  statistics  in  this  class  are  derived  using  a contiguous  sequence  of 
parametric  alternatives.  Formulation  of  optimal  statistic  is  done  in  Section  3.4.  In 
Section  3.5,  numerical  comparison  of  the  efficacies  of  a number  of  relevant  statistics 
is  done  using  a log-linear  model. 

3.2  Null  Distribution  of  the  Statistics  in  the  PR-Class 

A 

Let  J(-)  be  a score  generating  function  defined  on  [0, 1],  and  Fu{-)  be  the  condi- 
tional  subsurvival  function  of  \Zi\  defined  in  Section  2.2. 

Consider  the  statistic, 

T = f J(1  - K(t))  d(Ni(t)  - N2(t)) 

+ I Lud{Ns{t)-N^{t)),  (3.2.1) 

where  Liu  and  L\c  are  random  variables  satisfying  the  conditions. 
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(i)  Liu  = Lu  + Op(l),  for  some  constant  Lu  and  0 < Liu  < 

(ii)  Lie  = Lc  + Op(l),  for  some  constant  Lc  and  0 < Lie  < 1- 

The  form  of  T in  (3.2.1)  implies  that,  if  Liu  = 0,  then  the  uncensored  pairs  in  the 
data  have  no  contribution  to  the  statistic.  Similarly,  if  Lie  = 0,  then  the  censored 
pairs  in  the  data  have  no  contribution  to  T.  Indeed,  T can  be  represented  as  a linear 
combination  of  two  statistics: 

T = Liu  Tiu  + Lie  Tie',  (3.2.2) 


where 


Tiu  = j J{l-Fu{t))d{Ni{t)-N2{t)), 

and 

Tu  = j d(N,{t)  - 


(3.2.3) 


(3.2.4) 


Note  that  Tu  is  the  Sign  statistic  based  on  Zi  in  B3UB4.  Also,  corresponding  to  the 
score  function 


J(l-Fu)  = (1-K), 


(3.2.5) 


Tiu  is  the  Wilcoxon  signed  rank  statistic  based  on  Zi  in  B\  U 52,  because 

r,„  = / (1  - FJt))  d(N,{t)  - N,{t)) 

= nz,)Rt  - E 

i€BiUB2  ieBiUB2 


'^{Zi)Rt  - 


Tiu  (f  “b  Tiu) 


2 


4 
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= 2 ^ <a{Z,)Rt 

\t6BluB2 

= 2(T„  - fin), 


where  and  are  given  in  (2.3.3)  and  (2.3.6),  respectively. 

The  following  two  Lemmas  will  be  used  in  the  proof  of  the  asymptotic  normality 
of  r. 

Lemma  3.2.1  Suppose  the  score  generating  function  J is  continuous  and 

(i)  there  exists  M < 00  such  that  | J(u)|  < M for  all  0 < u < 1, 

(ii)  there  exists  Mq  < 00  such  that  |J^(u)l  < Mo  for  all  0 < u < 1. 


Let  T be  as  in  (3.2.1)  and  define 

T'  = j Liu  J{1  - Fu{t))d{Ni{t)  - N2{t))  + J Lu  d{No{t)  - N^{t)). 

Then 


^(r  - T')  = 0,(1). 

yTi 

Proof.  Notice  that,  if  = 0,  then 


{T  — T')  = 0 with  probability  1. 


For  pu  > 0,  we  have 


1 


y/n 


Liu  J{l-Fu{t))d{Ni{t)-N2{t)) 


{T  - T') 

— f/ 

y/n  U 

- I Liu  J(1  - Fu{t))d{Ni{t)  - N2{t)) 

= I Liu  [J(l  - Fu{t))  - J{1  - F„(t))l  d{Ni{t)  - 

yTi  J '• 


N2{t)).  (3.2.6) 
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Taylor  series  expansion  of  J(*)  around  (1  — -P’u(O) 


J(l-F„(f))  = J{1  - F^{t))  + {Fu{t)  - K{t))J'{(m 


(3.2.7) 


where 


min  ^1  - Fu(<),  1 - Fu{tfj  < ({t)  < max  1 - ■P’u(O)  • 


Using  (3.2.7)  and  boundedness  of  J in  (3.2.6)  we  have 


1 


y/n 


{T  - T') 


1 


< 


y/n 

1 


Tiu  (Fu(t)  - F„(f))  J'(e(0)  - ^2(0) 


Mo  sup|F„(f)  - Fu(f)|  suplLiul  I J d{Ni{t)  - N2{t)) 


Now,  using  the  fact  that,  for  p„  > 0, 


= 

Pu  Pu 


(3.2.8) 


and 


sup 


F:{t)  - P:(t)  = op(i), 


we 


have 


^ A 

SUp\Fu{t)  — Fu{t)\  = sup 


< 


F:{t)  F:{t) 


Pu  Pu 

F:{t)  ^ 

\Pu  Pu 

1 


sup 

Pu  Pu 
Op(l)  4-  Op(l)op(l) 


+ UKit)  - P:w) 

Pu 


+ — sup 
Pu 


Kit)  - F:(t) 


= Op(l). 


(3.2.9) 
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Again, 


1 


I d{Nr{t)  - N2{t))  < J d{N,{t)  + N2{t)) 


y/n 

= \/npu 

= C>p(l), 


(3.2.10) 


and 


sup  \Liu\  < 1. 


(3.2.11) 


Therefore,  using  (3.2.9),  (3.2.10)  and  (3.2.11)  in  (3.2.8),  we  get 


1 


\/n 


{T-T')  = Op(l). 


Lemma  3.2.2  Suppose  the  conditions  of  Lemma  3.2.1  are  satisfied.  Let 

= J J{1  - F^{t))d{N,{t)  - N2(t))  + j L,  d{Nz{t)  - N^t)). 


Then  for  T'  defined  in  Lemma  3.2.1, 


1 


■y/n 


{T  - T.)  = op(l). 


Proof.  We  have 


1 


y/n 


(T'-Ti) 


L y(I„  - Li„)  J(l  - F^{t))d{N^{t)  - N2{t)) 

J (L,  - Lie)  d{N2{t)  - N,{t)) 

L f d{Nr{t)  - N2(t)) 

/n  J 


+ 

< Op(l)  sup|J(l  - Fu(f))| 

1 


+ Op(l) 


y/n 


y/n 

d{N2{t)  - N4t)) 
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Now,  using  the  fact  that 


I d{N^{t)  - N,{t)) 


< -^  [ d{N,{t)  + N,{t)) 
y/n  J 

N, 

~ 

= Op(i), 


we  get 


y/n 

= Op(l)Op(l)  + Op(l)Op(l) 
= Op(l). 


The  following  theorem  gives  the  asymptotic  null  distribution  of  T in  (3.2.1). 

Theorem  3.2.1  Assume  AI-AIII  in  Section  2.4  are  satisfied.  Suppose  that  the  condi- 
tions on  the  score  generating  function  J given  in  Lemma  3.2.1  are  valid.  Then  under 
the  hypothesis,  Ho,  that  (ATi,  X2)  has  a distribution  symmetric  about  0,  T / y/n 
converges  weakly  to  a normal  distribution  with  mean 


and  variance 


CTj  = 2 


J Ll  J2(l  - F,{t))h{t)dt  + j Llh{t)dt 


A consistent  estimator  of  cr^  is  given  by 


1 


A-2  — 

(J  rp  

n 


"2  4 

'ZLl  j ^“(1  - h(t))  dNi{t)  + 

.1=1  1=3 


Proof.  In  order  to  prove  the  asymptotic  normality  of  T,  note  that 


T = Ti  + {T  -T')  + {T'  -Ti), 
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where, 

Ti  = / Lu  J(1  - F^{t))d{Nr{t)  - N2{t))  + j diN^it)  - 

and 

T'  = J Au  J(1  - F^{t))d{Ni{t)  - N2{t))  + J lie  - N^{t)). 

From  Lemma  3.1.1  and  Lemma  3.1.2  we  have 

i(r-T')  = op(i), 

\Jn 

and 

^(r'-r,)  = o,(i). 

yjn 

Therefore,  a proof  of  the  theorem  can  be  given  by  showing  that  T\  / \/n  is  asymptot- 
ically normal  with  mean  0 and  variance  cr|..  Now,  T\  can  be  expressed  as  : 

Ti  = J(l-F„(|Zil))  + L,(l-5i)] 

i=l 

= EA.  (3.2.12) 

1=1 

where 

I3i  = ei[6iL^J{l-F^{\Zi\))  + L,(l-6i)], 

and 


Note  that,  if  = 1,  then  both  observations  in  the  ith  pair  are  uncensored.  Under  Ho, 
and  conditional  on  6i  and  \Zi\,  the  probability  of  {Zi  > 0)  is  same  as  the  probability 
of  {Zi  < 0).  Therefore, 


E[ei  |Zi|  ] — 0, 


(3.2.13) 
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so 


that 


= E[E{f3i\6i,\Z^\)] 

= E[E{e,\8i,\Z,\)  [6,L^J{1- F^{\Zi\))  + Lc{l-6,)\ 

= 0. 


Also, 


var{(3i)  = E{!3l) 


= E [4  («?  Ll  J\l  - F.(|Z.D)  + Ll  (1  - S,f 
+2Si(\  - ii)  L„  L,  J(1 -F.(|2i|)))| 

(6]Ll  J"(l-F„(14l))  + i2(l-A)' 


= EE 


+2  Lc  <5.(1  - <5i)J(l  - F«(lZi|)))  6i 
= P[6i  = l]E[Ll  J\1  - F„(lZ.|))|6i  = 1]  + P[6i  = 0]E[Ll  \6i  = 0] 

= 2Fi(0)  J Ll  J\1  - F^{t))dF,^s,=^{t)  + 2F3(0)  j L]  c?Fe,5,=o(0 

= 2F,(0)  J Ll  J\1  - Fu{t))^^dt  + 2F3(0)  j L^  - 


= 2 


(0)”  ’ 2Fs{0) 

j Ll  J\1  - F^{t))Mt)dt  + j Ll  h{t)dt 


(J  'J' . 


Therefore,  since  the  j3i  are  independent. 


var 


II 

y/n 


= a'' 


Since  Ti  is  a sum  of  i.i.d  random  variables  with  mean  zero  and  variance  (a^  < 
oo),  the  asymptotic  normality  of  Til  y/n  follows  from  Lindeberg-Levy  central  limit 


theorem. 
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Next,  we  proceed  to  prove  that  is  consistent  for  Observe  that, 


''2  ~2  i/^2  "2^ 

O' 'p  — O' fji  "f*  ^ T ) y 


where 


-2 
O^  rp 


= - [ E / ‘^'(1  - dNi{t)  + E • (3-2.14) 

^ i=3 


Consistency  of  follows  if  we  show  that  is  consistent  for  o'p  and 


o\  — b\  — Op(l). 


From  (3.2.14),  we  have 


4 ^ -R(|Zil))  + i“(i  - <01 


»=1 

1 ” 


= - E"»’ 

n 


i-i 


where 


ai  = 6i  Ll  J\l  - F{\Zi\))  + Ll{l-6i)  t = l,...,n. 


are  i.i.d.  random  variables  with  mean. 


E{ai)  = E E{ai  6i) 


= EE 


SiLl  J\\  - F^{\Zi\))  + L\  {I  - 8i)  8i 


= P[8i  = 1]  E[Ll  J\l  - F^Zi\))\8i  = 1]  + P[8i  = 0]  E[Ll  \8,  = 0] 

= 2Fi(0)  J Ll  J\\  - + 2F3(0)  J L]  dPt^sM^) 

= 2F,(0)  J Ll  Al  - FM)  + mO) 


= 2 


J J^(l  — Fu{t))fi{t)dt  + j Ll  f3{t)dt 


O'  p » 
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Therefore, 


E{a^)  = 2 


Ll  J2(l  - F^it))h{t)dt  + / Ll  Mt)dt 


^2 


(3.2.15) 


Since  is  a sum  of  i.i.d.  random  variables  with  mean  cr^  {cj  < oo),  consistency  of 
dy  follows  from  the  law  of  large  numbers. 

Now,  we  have  to  prove  that 

~ 2 


dj  — (7^  = Op(l). 


We  can  write. 


1 


cr-p  — CTj’  — 2^ 


I 


U 


J^(l  - F„(t))  - J^(l  - K{t)}  dNi{t). 


Applying  Taylor  series  expansion,  we  obtain, 

- F„(i))  = - F.(<))  + (F.(«)  - 2 J(«())7(f  (()) 


where 


min  (l  - F„(t),  1 - < ({t)  < max  (l  - F^{t),  1 - F„(t))  . 


Therefore, 


-2  - 2 
<j  j — cr  j 


= |-E  I i‘lwt)-Fu(t))2j{mAm)dNi{t) 

1 I ^ /• 

< -MMo  sup\F^{t)  - K{t)\ 

” i=i 


Since, 


if;/  dN;(,t)  = 

n ^ j 
1=1 


N^  + Nc 


n 


= Op(l), 


using  the  same  argument  as  in  the  proof  of  Lemma  3.2.1,  we  have 


This  completes  the  proof  of  the  Theorem  3.2.1. 
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3.3  EfScacies  of  the  Statistics  in  the  PR-Class 


In  order  to  derive  the  efficacies  of  the  tests  based  on  statistics  of  the  form  T in 
(3.2.1),  we  consider  the  sequence  of  contiguous  alternatives  with  densities  of  the  form, 


V’nlxi,  ®2)  = ^2) 


7n(a:i,  X2) 


(3.3.1) 


where  {7n(a:i,  X2)}  is  a sequence  of  asymmetric  functions  converging  to  an  asymmetric 
function  q(xi,  X2)  for  almost  all  (xi,X2)  such  that 


/ 7n(xi,  X2)t/>„(Xi,  X2)  dxr  dx2  = J 7(xi,  X2)  V'n(xi,  X2)  dxi  dx2  = 0. 

Dabrowska  (1990)  considered  the  same  sequence  of  contiguous  alternatives  in  her 
derivation  of  the  efficacies  of  the  WL-class  of  statistics. 

In  what  follows,  a parametric  sequence  of  alternatives  to  Ho  with  real  parameter, 
$,  will  be  specified  as  {V>(xi,  X2, : ^„)}  where  = $o  + for  some  constant  c > 0, 
and  the  density  under  the  null  hypothesis  is  a symmetric  density,  ip{xi,  X2,:  ^o)- 
Then  7(xi,  X2)  is  given  by 


7(xi,  X2)  = c—  log-tf^ixi,  X2  : 9) 


6=6o 


Without  loss  of  generality  we  will  assume  c = 1 because  in  the  expression  of  efficacy, 
c appears  in  the  numerator  and  denominator  and  therefore  efficacy  is  invariant  with 
respect  to  the  value  of  c. 

In  order  to  derive  the  efficacy  of  a statistic  in  the  PR-class,  the  distribution  of  T 
in  (3.2.1),  under  alternative  is  required.  By  applying  Le  Cam’s  third  lemma  (Hajek 
and  Sidak,  1967)  we  can  find  the  asymptotic  distribution  of  T under  contiguous 
alternatives.  Following  definitions  will  be  used  in  the  derivation  of  the  asymptotic 
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distribution  of  T . 


Aq  — 


and 


Po' 


oo  poo 


u 


7(xi,  S2)V’(^i)  X2)  dxi  dx2\  dG{u), 
Ai{t)  = j G{u) '</{u  - i,  u)ip{u  - t,u)  du, 

A2{t)  = [/2(0]~^  / G{u) 'riu,  u - t)ip{u,u  - t)  du, 

A3{t)  = [/3(f)]“^  j {j  7(w  - t,  J/)  V-Cu  - f,  y)  dy}  dG(«), 


>14(0  = ^ J {j'r{y,u-t)ij{y,u-t)dy]dG{u).  (3.3.2) 

Let  ; =1,2,3,4,  and  po«  denote,  respectively,  the  improper  densities  and 

the  probability  of  double  censoring  obtained  by  replacing  ^2)  with  i’ixi,  X2  ■ 0) 

in  (2.2.4)  and  (2.2.3)  . Then,  it  is  easy  to  see  that 


Aq  = — log  Poe  6 = Oq, 


and 


= la  ^ ~ ^ ~ 1,2,  3, 4. 

du 


For  example. 


/ { L L 


°°  /'°°  dlogip{xi,  X2  : 9) 


dO 


V>(xi,  X2, : ^0)  dx\  dx2  [ dG{u) 

6=0q  j 


= Po^o 


f°°  rp'{xi,  X2  : 0) 


i/>(xi,  X2) : 9o)  dx\  dx2  f dG{u) 
9=60  ^ 


u ^2  '•  9) 

— P060  w:i:  4>'{xi,  X2  : 9)  dxi  dx2}|^_g  dG{u) 

^ Pooo  / {/  / : 9)  dxi  dx2}  dG{u) 


d9 
dlog  Poe 
d9 


e=9o 


We  also  define  />«(•)>  i = 1,2, 3, 4,  and  pon  just  as  /,(•)  and  po,  except  that 
i/>(xi,  X2)  is  replaced  by  ^2),  as  given  in  (3.3.1).  The  following  theorem  gives 
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the  joint  distribution  of  T in  (3.2.1)  and  the  log-likelihood  ratio  : 


log 


dPn 

dP 


= E 

J = 1 


^^9  4^  ^ E Ci. 

fj{t)  Po 


(3.3.3) 


where  (i  = {1  — 5ii)(l  “ ^2i)- 


Theorem  3.3.1  Suppose  that  the  assumptions  of  Theorem  3.2.1  are  satisfied  and  that, 


as  n — >•  oo, 


j4on  — 2 \/n 


Po 


1/2 


n 


Po 


- 1 


^0) 


and 


J {Ajn{t)  - Aj{t)Y fj{t)  dt  0, 


where 


Ajn{t)  = 2y/n 


fjnjt) 

fj{t) 


1/2 


- 1 


i = l,2,3,4. 


Then,  under  the  null  hypothesis,  {log  converges  weakly  to  a normal  dis- 


tribution with  mean  {—all2,  0)  and  covariance  matrix 

^ Ct^ 

, Ct  j 
where  o\  is  defined  in  Theorem  3.2.1  and 

^0  = / + Al{t))h{t)dt  -f  J (^^(t)  + Al{t))f3{t)dt  -h  AIpI, 

Ct  = I J(1  - F{t)){Ai{t)  - A2{t))dt  + j Lc  (^3(<)  - A^{t))dt. 

Proof.  To  derive  the  joint  distribution  of  log  and  T( -^/n,  note  that,  by  Le  Cam’s 
second  lemma  (Hajek  and  Sidak,  1967),  it  is  enough  to  consider  the  joint  distribution 


of  T ! ^/n  and  Kni  where 


Kr.  = 


1 

\/n 


^ [ A„.{t)dN^{t)  + Aon  E C. 

_j=i  i=i 


(3.3.4) 
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From  Theorem  3.2.1,  we  know  that  asymptotic  distribution  of  T is  the  same  as 
the  asymptotic  distribution  of  T\  defined  in  Lemma  3.2.2.  Therefore,  we  will  consider 
the  joint  distribution  of  Tijy/n  and  Kn-  However,  to  derive  the  distribution  of  Kn, 


it  is  convenient  to  consider. 


= 


1 


I + ^0  ^ Ci 

i=i  . 


(3.3.5) 


Now,  we  proceed  to  derive  the  expectation  of  Sn-  We  have 


V^5„  = [ MtWAt)  + Ao  E C. 

j = l i=l 


n 


E 

t = l 


^2i 


+ (^1«  (1  ~ ^2»)  + (1  ~ ^It)  ^2t) 


+ >lo  (1  — ^It)  (1  ~ ^2i) 


2 

(1  + Ci) 


A3{\Zi\)  + ^ ^ j44(|Zi|) 


n 


= E#?;- 


(3.3.6) 


i=l 


Clearly,  /3*’s  are  i.i.d.  random  variables.  Under  Hq,  the  mean  of  /3*  is. 


E{f3:)  = E E[I3:  \ 6u  62i] 

= P[Su  = S2i  = l]E 


— ^2«  — 1 


+ P[6u  = 1,  62i  = 0]E 


(1  + €i) 


Az{\Zi\) 


+ (1^A4(|z,|)  ^1,  = 1,  ^2.  = 0 


+ P[6u  = 0,  S2i  = 1] 


(1  + ^i) 


Az{\Z.l) 


+ A,(|Zi|)  = 0,  = l 


+ P[^ii  = S21  = 0]  P Ao  Su  = 821  — 0 


(3.3.7) 
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Under  Ho,  Fi(0)  = ^2(0)  and  P[6u  = 821  = 1]  = 2Fi(0).  Therefore  the  first  term  on 
the  right  hand  side  of  (3.3.7)  simplifies  to, 


2Fi(0)  E 


— ^2t  — 1 


= 2Fi(0)  P{ti  = 1 \ 6u  = 821  = 1,  = 1]  E[Ai{\Zi\)  I 8u  = 82i  = 1,  ti  = 1] 

+ 2F2(0)  P[ei  = —1  = ^2i  = 1,  €i  = —1]  X 

E[A2{\Zi\)  8u  = 82i=l,  €i  = -l] 


= 2F.(0)i  / .4,(0  + 2F,m\  / M(t)  II  dt 

= J .^1(0  /l(0  ^ j f2{t)  dt. 

Similar  calculations  can  be  done  for  other  terms  and  we  can  write, 

EH3*)  = j Ai(t)  hit)  dt  + J A2{t)  f2{t)  dt 

+ j Ash)  foit)  dt  + j A^it)f^it)dt  + AoPo- 


After  some  simplification,  it  can  be  shown  that 


F(/?;)  = 0. 


(3.3.8) 


Now  consider, 


Eifif)  = E E[!3f  \ 8u  82r] 

= P{8u  = 82i  = l]E 


(1  + e.) 


+ Mm) 


Mm) 


8u  = 82i  = I 
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+ P[S\i  — 1, 62*  — 0]E 


(1  + 


A,m) 


Su  = 1,  in  = 0 


+ — 0, 621  — l]E 


(1  + €i) 


A^m) 


+ — ^2t  — 0]  E 


A': 


Sti  = 0,  Sii  = 1 


= ^2»  — 0 


(3.3.9) 


Under  the  null  hypothesis,  the  first  term  on  the  right  hand  side  of  (3.3.9)  equals 


2Fi(0)E 


^i^Ai(IZil)  + il^A2(|Z.i) 


li  — ^2t 


= 1 


= 2Fi(0)P(€i  = l Su  = S2i  = l)E  Ali\Zi\)  6u  = S2i  = l,  €i  = l 


+2F2{0)P{ei  = -1  6u  = S2i  = \)E  Al(\Zi\)  Su  = 82i  = 1,  e.  = -1 


Similar  calculations  can  be  done  for  other  terms  in  (3.3.9)  to  obtain 

E(pf)  = 2F.(0)  i / m M M + 2F,(0)i  / Al(t)  ||  dt 

+ 2F3(0)  i / Al(t)  ^ 


= J {Al{t)  + >12(0)  /i(0  A J {Al{t)  + j44(t)^  fj  dt  + poAl 


Since  /3*’s  are  i.i.d., 


var{s/nSn)  = nvar{f3*). 


That  is, 
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var(Sn)  = var{)3*) 
= E(l3f) 


(Jq. 


Therefore,  under  Ho,  5„  is  a sum  of  i.i.d.  random  variables  with  mean  zero  and 
variance  ctq,  so  that  by  the  central  limit  theorem. 


1 


y/n 


'n 


Asymptotically  Normal  I 0, 


0 


n 


Now,  the  next  step  is  to  find  E{Kn)-  By  definition, 

fln{t)  = J 1pn{u-t,t)G{u)du 

= j ij;{u  - t,t)  G{u)  du  + J 7„(tx  - t,t)  V>(u 

= /i(0  + f ~ 0 

y/vi  J 


— t,  t)  G{u)du 


= fi{t)  + hn{t),  say. 


(3.3.10) 


Let 


g{u)  = + u)5. 


Differentiating  g{u)  with  respect  to  u, 


y 1 1 

9{^)  = 


/(u)  = 


^ \//i(0  + “ 
1 1 


^ (\//i(0  + “)' 


Expanding  ^(u)  around  u = 0,  we  get. 


ti' 


g{u)  = g{0)  + ug'(0)  + ^ /(O)  + o(u^). 
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Then  using  (3.3.10)  and  the  fact  that  g{hn{t))  = \/ fin(t)  we  get 


\//in(0  = yflii)  + 


1 


+ 


1 


8^/i(t)/i(t)n 


u — t,t)  'ip{u  — t,  t)  G{u)du 


u — t,t)  'tp[u  — t,  t)  G{u)du 


+ 0{hl{t)) 


\j fl{t)  + 2 V “■  ^ \//l(0  + <^(^n(0)- 


Therefore,  using  the  definition  of  Ain{t)  from  the  statement  of  Theorem  3.3.1, 


Ai„(t)  = 2>/n 


V /i(o ; 


= Ai{t)  - 442  + o(n~2). 


A^/n 


Similarly,  it  can  be  shown  that. 


Ajn{t)  = Aj{t)  - + o{n  0, 


j=  2,3,4, 


(3.3.11) 


and 


j\2  ^ 

^on(t)  = Ao  — ■%  + o(n  2). 

Ay/n 


(3.3.12) 


Following  the  technique  used  in  the  derivation  of  expectation  of  Sn,  (3.3.11)  and 


(3.3.12)  we  have 


E{Kn)  = 


f + ^OPO 

>=1 


1 

4 


^oPo 

i=i 


+ ‘>(1) 


= j ^On  PO 


2 

4 ^0- 
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Now  notice  that 


n 


V^(X„-5„)  = 


t=l 


^2i 


(1  + €i) 


{AU\Zi\)  - M{\Zi\)) 


+ ^^^(^n(|Zi|)-  -42(12^1)) 


+ (^It  (1  ~ ^2»)  + (1  “■  ^2t) 


(1  + Ci) 


(A3„(|Z.|)-A3(|Z.D) 


+ ^^^(-4.„(lZ,|)->l4(|Z-l)) 

+ Ao  (1  — ^It)  (1  — ^2t)] 


n 


= E/5.' 

i=l 

2 

Following  the  technique  used  in  the  derivation  of  £!(/?*  ) we  can  show  that 


»or(A')  < B(ft) 
= 


(-^On  ~ Po*  (3.3.13) 


i=i 


From  (3.3.13),  it  can  be  shown  that, 


var 


{K„  — Sn)  < E / ~ + ("^On  — >lo)^  Po 

i=i 


0. 


Therefore,  under  the  null  hypothesis,  K„  is  asymptotically  normal  with  mean  — 


4 


and  variance  Uq.  We  can  use  Le  Cam’s  second  lemma  to  complete  the  proof  of  the 


asymptotic  normality  of  the  log-likelihood  log^ , that  is 


log 


dPn 

dP 


r\^ 


Normal  ( — -<7’o>  "^o)- 


(3.3.14) 


By  following  a procedure  similar  to  that  used  for  derivation  of  the  mean  and 
variance  of  5„,  we  get. 


II 

y/n 


= J Lu  J {I  — Fu)  ~ f lil)  dt  -f  J Lc  {As{t)  — f3{t)  dt 


= CT- 


cov 
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To  complete  the  proof  of  this  theorem,  it  is  required  to  show  the  joint  asymptotic 
normality  of  Tily/n  and  5„.  From  (3.2.12)  and  (3.3.6)  we  have  : 

Tr  = EA, 

1 = 1 

= Y.K- 

t = l 

Therefore,  for  real  a\  and  a2, 

n 

a-iTi  + a2\/nSn  = Y^{ai(3i  + a20*) 

1 = 1 

is  a sum  of  i.i.d.  random  variables.  We  have  already  derived  the  expectations, 
variances  and  covariance  of  T\  and  y/nSn-  Since  a\Ti  + a2y/nSn  is  a sum  of  i.i.d. 
random  variables,  the  asymptotic  normality  of  aiT\-\-a2y/nSn  follows  from  Lindeberg- 
Levy  central  limit  theorem.  For  real  oi  and  02,  the  mean  and  variance  are 


E{aiTi  + a2\/nSn)  = oiO  + 0.2O 

= 0, 

and 

var{aiTi  + <i2\/nSn)  = a\na\:  + a\naQ  + 2a\a2txcT.  (3.3.15) 

Now  applying  the  Cramer- Wold  device,  we  have  the  joint  asymptotic  normality 
of  Tily/n  and  5„.  That  is,  {Tijy/n,  5„)  is  asymptotically  normal  with  mean  (0,  0) 
and  covariance  matrix, 

/ \ 

<^1  Ct 

y Ct  ctj  ) 

This  completes  the  proof  of  the  Theorem  3.3.1. 

As  a consequence  of  Le  Cam’s  third  lemma  (Hajek  and  Sidak,  1967),  the  statistic 
Tly/n  converges  weakly  to  a normal  distribution  with  mean  Ct  and  variance 
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Therefore,  the  efficacy  of  the  PR-class  of  statistics  is  given  by 


The  next  section  is  devoted  to  using  the  efficacies  to  derive  expressions  for  optimal 
statistics  in  the  PR  and  WL-classes. 


3.4  The  form  of  Optimal  Statistic 


This  section  is  devoted  to  finding  statistics  in  the  PR  and  WL-classes,  such  that 
they  have  the  highest  efficacies  in  their  classes. 

Notice  that,  like  the  PR-class  of  statistics,  the  WL-class  of  statistics  can  also  be 
represented  as  linear  combinations  of  two  statistics  : 


Twl  = I Ml  - S{t))  d{Nr{t)  - N^it)) 

-F  J L2c  Jc(1  - 5(t))  d{N^{t)  - N,{t)) 

= L2JT2U  + L2cT2ci 


(3.4.1) 


where  the  score  functions  J„(-)  and  Jc(-)  satisfy  the  relationship 

Uv)  = {(1  - v)  JM)-  (3.4.2) 

The  statistics  T2U  and  P2c  corresponding  to  WL  Sign  statistic  are  denoted  by  T,u 
and  T,c  respectively.  Similarly,  statistics  T2u  and  T2C  corresponding  to  WL  Wilcoxon 
signed  rank  statistic  are  denoted  by  T^u  and  T^c-  Also,  notice  that,  Tu  given  in 
(3.2.4)  is  identical  to  T,c. 

In  what  follows,  a statistic  with  maximum  efficacy  in  a certain  class  is  called  an 
optimal  statistic  for  that  class.  In  case  an  optimal  statistic  is  a linear  combination 
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of  two  statistics,  the  corresponding  weights  (coefficients)  in  the  liner  combination  are 
called  optimal  weights. 

Theorem  3.4.1  is  adapted  from  Theorem  10.2.3  in  Serffing  (1980).  This  theorem 
gives  an  expression  for  the  efficacy  of  an  arbitrary  linear  combination  of  two  statistics. 
The  theorem  can  be  used  to  derive  the  optimal  statistics  in  the  PR  and  WL-classes. 

Theorem  3.4.1  Let  T*„  and  Tj*„  be  two  statistics  for  testing  Hq.  Assume  that 
and  satisfy  Pitman  Conditions  given  in  10.2.1  in  Serffing  (1980),  and  are  in 
standardized  forms.  Then  for  0 < Ai,  A2  < 1,  statistics  of  the  form. 


Txux.  = + A2T2*„,  0 < Ai,  A2  < 1 

satisfy  Pitman  conditions.  The  efficacy  of  is  given  by 


(3.4.3) 


where 


Cl  = (efficacy  of  Tin)  C2  = (efficacy  of  T2„)  5 


and 


Proof.  See  Serffing  (1980). 

Notice  that  if  Ai  + A2  7^  0,  then  in  Theorem  3.4.1  is  equivalent  to 

rA  = (i-A)Ti*„  + Ar;„,  (3.4.4) 

where, 


Ai  + A2 


The  efficacy  of  T\  is  given  by 
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eff{Tx) 


[ilzM  + A]2 

[{1  - xy +\^  + 2{i  - \)Xpy 


where  Ci,  C2  and  p are  given  in  Theorem  3.4.1. 

The  following  theorem  shows  that  p = 0 for  statistics  in  the  PR  and  WL-classes. 


Theorem  3.4.2  Let 

TpR  = + LicT*^, 

and 

'^WL  — ■^2u2^2u  + L2CT2*, 

denote,  respectively,  statistics  in  the  PR  and  WL-classes.  Then  under  the  null  hy- 
pothesis of  bivariate  symmetry. 


lim  corr{T^^,  = 0, 


n— »oo 


and 


lim  corr(T2*,  T^*)  = 0. 


n— ♦oo 


(3.4.5) 


Proof.  First  we  consider  the  PR-class  of  statistics.  Popovich  (1983)  proved  that  Tiu 
and  Tic  are  conditionally  independent  given  = n„,  Nc  = ric  that  is. 


cov{T*^,  T*^  iVu  = n„,  Nc  = ric)  = 0. 


Again,  under  Ho,  we  have 


I Nu  = riu,  Nc  = nc]  = 0, 

E[T*c  I Nu  = n„,  Nc  = nc]  = 0, 


(3.4.6) 


so  that 


cov  [£^(r*„  I AT„  = n„,  Nc  = Uc),  E{T*^  AT„  = n„,  Nc  = nc) 


0.  (3.4.7) 
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Therefore,  from  (3.4.6)  and  (3.4.7)  and  the  fact  that  for  random  variables  X,  Y and 

Z, 


cov{X,  Y) 


E 


cov{X,  y\Z)]+  cov  [e{X  I Z),  E{Y  \ Z) 


it  follows  that 

cov(r-„.  Tl)  = 0. 

Thus,  /9  = 0 for  the  PR-class  of  statistics. 

Now  we  consider  the  WL-class.  Recall  from  Section  2.2, 

m = E PAt)- 

i=i 

Notice  that  Tj*  and  Tj*,  are  standardized  versions  of  and  respectively, 
where, 

TL  = J - nmmt)  - N2(t)) 

= =E£dW„(l-f(|2.|))l 

i=l 

= (3.4.8) 

1=1 

and 

TL  = j J,(i  - ntMNM  - 

= =X:£,1(1-«.)J,(1-F(|Z,|))| 

i=l 

= Efe.  (3.4.9) 

1 = 1 

Dabrowska  (1990)  showed  that 


^2u  — ^2u  — 
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and 


TL-T2C  = 0,(1), 


where  Tau  and  T2C  are  given  in  (3.4.1).  Therefore,  we  can  consider  instead  of 

Twl- 


As  noted  in  (3.2.13), 


E\ei  Si,  \Zi\]  — 0, 


under  Ho-  Straightforward  calculations  similar  to  those  at  (3.2.13)  gives. 


E[l3„i  Si,\Zi\]  = 0, 
E\l3ci  Si,  \Zi\]  = 0, 


and 


E[l3i^i/3cj  Si,Sj,  \Zi\,\Zj\]  = 0,  l<i,j,<n. 


Therefore,  the  conditional  covariance  of  Tj*  and  Tj*  is 


con [.£J(r2* , Tj*  Si,...  Sn,\Zi\,...  \Zn\) 


= 0, 


Using  the  same  argument  as  in  the  previous  case,  the  correlation  coefficient  between 
Tj*  and  Tj*  for  the  WL-class  can  be  shown  to  be  0 under  null  hypothesis. 

The  next  theorem  gives  an  expression  for  A in  Theorem  3.4.1  such  that  the  efficacy 
of  T\  is  maximum  when  the  asymptotic  correlation  between  Ti„  and  T2n  is  zero  under 
the  null  hypothesis. 


Theorem  3.4.3  Assume  conditions  of  Theorem  3.4.1  and  suppose  that. 


liin  corr{T*^,  = 0 


n-^oo 
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under  Hq.  Then  statistics  of  the  form  T\  attain  majcimum  efficacy  for 


A = 


Cl 


Cl  + C2 


The  maximum  efficacy  is  given  by 


eff{Tx)  = 


2 J 


Proof.  From  Theorem  3.4.1,  the  efficacy  of  T\  when  = 0 is, 

[(1-^)  J.  A]2 


'//(rx)  = 


i(i-A)^  + A’r 


Maximizing  eff{T\)  is  same  as  maximizing  log  eff{T\).  Now  differentiating  log 
ef f{T\)  with  respect  to  A and  equating  the  resulting  expression  to  0,  we  have 


Cl  - C2 


1 4A  -2 


(1  - A)c2  + Aci  2 1 + 2A2  - 2A 


= 0. 


(3.4.10) 


If  Cl  + C2  ^ 0,  solving  the  equation  in  (3.4.10)  we  get. 


A = 


Cl 


Cl  + C2 


Again  differentiating  the  left  hand  side  of  (3.4.10)  with  respect  to  A,  we  have 

-(ci  - C2)(ci  - C2)  -2(1  + 2A2  - 2A)  - (2A-l)(4A-2) 


[(1  — A)c2  + Aci]^ 

(Cl  - C2)^ 


[(1  — A)c2  + Aci]^ 
(Cl-C2)^ 


[1  + 2A2  - 2A]2 
2[(1  + 2A2  -2A)  - (2A  - 1)^] 
[1  + 2A2  - 2A]2 
4A(1  - A) 


[(1  - A)c2  + Aci]2  [1  + 2A2-2A]2 


< 0, 


showing  that  ef f{T\)  is  maximized  at 


A = 


Cl 


i 


C\  + C2 
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The  meiximum  efficacy  is  given  by 


eff{Tx) 


-£2_ 


+ 


Si. 


(Cl+C2)ci  (Cj-fC2  )C2  ■ 


{^)  + ( 


Si. 


Ci4-C2 


1 1 


'2. 


where  Ci  and  C2  are  given  in  Theorem  3.4.1. 


Listed  below  are  the  variances  and  efficacies  of  the  various  statistics  considered 
thus  far.  These  expressions  can  be  derived  from  Theorem  3.2.1  and  Theorem  3.3.1 
and  will  be  needed  later  in  this  manuscript.  Recall  that  F{t)  appearing  in  some  of 
the  expressions  is  defined  as 

J = 1 

First,  to  consider  the  PR-class  of  statistics,  we  need  the  expressions  of  variances 
and  efficacies  of  Tu  and  Tiu  which  are  given  by 


var{Tic)  = 2 / fz{t)dt 


eff{Tu)  = 

var(Tiu)  = 


= 2 


2 ^3(0), 

[/(^3(^)  - A4{t))f3{t)dt]'^ 

2 F3(0) 

2 I (1  - fi{t)dt 

J \ Fx(0) 


fi{t)dt 


efm.)  = 


3 ^i(O), 

[/(I  - Fi(t))  {Ai{t)  - A2{t))fi{t)dt] 

i Fi{0) 


(3.4.11) 


Recall  that  Tic  is  identical  to  T,c-  Therefore  to  consider  WL  Sign  statistic  we  need 
the  expressions  of  variance  and  efficacy  of  in  addition  to  those  for  T,c- 


var{T,u) 


j fi{t)dt, 


e//(T,„)  = 


[J  {Ai{t)  - A2{t))  fi{t)  dt^ 
2 J{l-F{t)y  hit)  dt 


(3.4.12) 


Lastly,  for  WL  Wilcoxon  sign  statistic,  we  require  the  expressions  of  variance  and 
efficacy  of  T^u  and  T^c  which  are 


var{T^uti) 

e//(7’u,u) 


2 j {1  - F[t))^  hit)  dt 

U jAht)  - A^jt))  jl  - Fjt))  hit)  dt]^ 

2 fil-Fit))^hit)dt 


var{  T„,c ) = 

eff{T^)  = 


2 / 

[/  (.4,(0  - A,(t))  (1  - \F(t))  h(t)  dt]’^ 
2 / (1  - iF(())>  h(t)  dt 


(3.4.13) 


In  the  next  section,  the  efficacies  of  selected  members  in  the  PR  and  WL-classes 
are  compared  with  the  efficacies  of  optimal  statistics  in  these  classes. 

3.5  Asymptotic  Relative  Efficiencies  of  Various  Statistics 

As  noted  in  Section  2.3,  the  model  considered  by  Woolson  and  Lachenbruch  (1980) 
is 

exp(A\)  = aV\W, 

expiX'h  = V2W, 


(3.5.1) 
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where  X[  = log  Xi,  t = 1, 2,  a > 0 is  an  unknown  parameter,  Vi  t = 1,2,  are 
independent,  identically  distributed  non-negative  random  variables  and  W is  a non- 
negative random  variable  independent  of  the  V^.  From  (3.5.1), 

X'^  = log  a + log  Vi  + log  W, 

X'^  = logV2  + logW.  (3.5.2) 

We  can  rewrite  the  equations  in  (3.5.2)  in  the  form  : 

A\  = $ r/i  -{■  g,, 

X2  = V2  + 1^, 

where  7/1,  t]i  are  independent,  identically  distributed  random  variables  and  is  a 
random  variable  independent  of  the  rji. 

In  this  section  we  present  the  ARE  of  various  statistics  under  the  assumption  that 
the  density  of  r]i  is  the  extreme  value  density  given  by 

fr,i{u)  = —00  <u  < 00,  t = 1,2,  (3.5.3) 

and  that  of  g is  extreme  value  density  given  by 

y^(u)  = —00  <u<  00,  —00  < < 00,  t > 0. 

Assuming  extreme  value  distributions  for  rg,  i = 1 and  2,  we  get  logistic  dis- 
tribution for  X'2  — X[.  This  is  very  reasonable  because  the  behaviour  of  random 
variables  with  logistic  distribution  is  close  to  that  of  random  variables  with  normal 
distribution. 

Furthermore  if  C = log  C where  C is  the  censoring  time,  we  assume  that  the 
density  of  C is  also  extreme  value  density  given  by 

fc'{u)  = —00  < u < 00,  —00  < 02  < 00,  t > 0. 
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Note  that  under  the  above  assumptions,  the  correlation  coefficient  between 


and  X'2  is  given  by 


P = 


covjXl  X'2) 
\Jvar{X[)  uar(Ai) 


IT 

6? 


SL  4.11 
^ ' 6 

1 

t2  + 1’ 


(3.5.4) 


which  approaches  1 as  r — ^ 0.  Under  the  null  hypothesis  of  symmetry,  probability  of 
a doubly  censored  observation  is  given  by 


Po 


= P[X[>C\X'2>C’] 


= P[t7i  -\rjji>C',r]2-\-p>C] 


P[t7i  > C'  - ll,  T]2  > C'  - jx] 

^gr(z-a2+«i) 


/oo 
-00 


-2c 


(1  + 


dz. 


Therefore,  the  parameters  r,  ai  and  ct2  determine  the  heaviness  of  censoring. 
Correlation  coefficient  p depends  on  t only.  For  a fixed  value  of  t,  that  is,  for  a fixed 
value  of  p,  probability  of  double  censoring  depends  on  the  values  of  ai  and  a2- 
The  case  of  p = 0 is  equivalent  to  the  independent  case  here. 


X;  = ^ + T7a, 

a;  = V2,  (3.5.5) 

where  the  distribution  of  r]i  i = 1,2,  are  give  by  (3.5.3).  Therefore,  under  Ho, 
probability  of  a doubly  censored  observation  is  given  by 

Po  = P[X[  > c' , x’2  > C] 

= r dc. 

J — OO 
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For  the  purpose  of  efficacy  calculation,  the  parameter  r is  so  chosen  that  the 
correlation  coefficient  between  the  underlying  log-failure  times  is  p = 0,  0.10,  0.25, 0.50 
and  0.75.  Further,  for  each  of  these  values  of  p,  the  parameters  ai  and  02  are  selected 
in  such  way  that  under  the  null  hypothesis,  the  probability  of  a doubly  censored  pair 
is  equal  to  po  = 0.10,  0.25,  0.50  and  0.75.  The  following  statistics  are  considered  for 
numerical  comparison  of  their  efficacies. 

1.  PR  statistic  with  equal  weights  (PR-EQ). 

2.  PR  statistic  with  weights  proportional  to  the  standard  deviations  (PR- 
STD). 

3.  PR  statistic  with  optimal  weights  (PR-OPT). 

4.  WL  Sign  statistic  with  equal  weights  (S-EQ). 

5.  WL  Sign  statistic  with  weights  proportional  to  the  standard  deviations 

(S-STD). 

6.  WL  Sign  statistic  with  optimum  weights  (S-OPT). 

7.  WL  Wilcoxon  signed  rank  statistic  with  equal  weights  (W-EQ). 

8.  WL  Wilcoxon  signed  rank  statistic  with  weights  proportional  to  the  stan- 
dard deviations  (W-STD). 

9.  WL  Wilcoxon  signed  rank  statistic  with  optimal  weights  (W-OPT). 

Last  six  statistics  in  the  list  are  from  the  WL-class. 

Table  3.1-Table  3.4,  given  on  pages  51-54  contain  the  asymptotic  relative  effi- 
ciencies (ARE)  of  the  above  nine  statistics.  Tables  3. 1-3.4  can  be  used  to  calculate 
the  ARE  of  any  two  statistics  given  in  the  list. 
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The  figures,  Fig  3.1-Fig  3.5  on  pages  55-59  show  the  efficacies  of  the  nine  statistics 
when  p = 0,  0.10,  0.25,  0.50  and  0.75. 

From  Table  3.1,  in  the  independent  case  (p  = 0),  and  when  the  probability  of 
double  censoring  is  low,  S-EQ  is  almost  as  efficient  as  S-OPT.  The  loss  in  efficacy 
is  only  2.68%  when  po  = 0.10  and  p — 0.  But,  S-STD  is  performing  better  as 
the  probability  of  double  censoring  increases.  For  a fixed  value  of  po  and  non-zero 
correlation  coefficient  p,  performance  of  S-EQ  improves  with  increasing  value  of  p. 
But,  for  a fixed  value  of  po,  performance  of  S-STD  goes  down  with  increase  in  p.  If 
optimal  weights  cannot  be  used,  then  for  lower  values  of  po,  S-EQ  should  be  used. 
For  higher  values  of  po,  S-STD  is  preferable. 

In  Table  3.2,  in  the  independent  case,  W-EQ  has  high  efficiency  when  compared 
to  W-OPT  for  low  value  of  po.  For  po  = 0.10  and  p = 0,  the  loss  of  efficacy  resulting 
from  using  W-EQ  instead  of  W-OPT  is  only  1.25%.  As  po  increases,  loss  in  efficacy 
resulting  from  using  W-STD  instead  of  the  optimal  statistic  in  this  class,  W-OPT, 
is  less  than  that  resulting  from  using  W-EQ  instead  of  W-OPT.  When  po  is  low, 
either  W-EQ  or  W-STD  could  be  used  instead  of  W-OPT.  For  po  = 0.10,  efficacy 
loss  by  using  W-EQ  is  less  than  8%,  whereas  the  loss  never  exceeds  15%  if  W-STD  is 
used  instead  of  W-OPT.  In  general,  for  low  values  of  po  and  high  values  of  p,  W-EQ 
is  preferable  over  W-STD.  But  for  high  values  of  po  W-STD  is  performing  better 
irrespective  of  the  magnitude  of  correlation  coefficient. 

Comparing  statistics  belonging  to  the  PR-class  from  Table  3.3,  almost  always 
PR-STD  is  performing  better  than  PR-EQ.  The  only  exceptions  are  when  po  = 0.10 
and  correlation  coefficient  is  either  very  low  or  very  high.  In  these  cases,  PR-EQ  is 
almost  as  efficient  as  PR-OPT.  Particularly  for  po  = 0.10,  by  using  PR-EQ  instead 
of  PR-OPT,  the  loss  in  efficacy  is  less  than  7.26%.  In  general,  the  use  of  PR-STD 
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is  recommended.  The  loss  in  efficacy  resulting  from  the  use  of  PR-STD  instead  of 
PR-OPT  is  never  more  than  22%. 

From  the  Table  3.4,  one  can  see  that  when  /?  = 0,  W-OPT  is  more  efficient  than 
the  other  optimal  statistics.  All  three  statistics  compared  in  this  table  have  efficacies 
within  1%  of  each  other  for  high  value  of  po  in  the  independent  case.  In  general, 
efficacy  of  PR-OPT  is  slightly  higher  than  the  efficacy  of  optimal  statistics  in  the 
WL-class. 

By  examining  the  efficacies  of  the  nine  statistics  given  in  Tables  3. 1-3.4  and 
displayed  in  Figures  3. 1-3.5,  it  is  evident  that  the  members  of  the  PR-class  are  per- 
forming very  well  in  almost  every  case.  Also,  for  small  sample  sizes,  exact  test  can 
be  performed  for  members  of  this  class  of  statistics.  When  construction  of  optimal 
statistics  is  not  possible,  depending  on  heaviness  of  censoring  and  degree  of  associa- 
tion in  the  pairs,  decision  has  to  be  made  about  using  PR  statistic  with  equal  weights 
or  with  weights  proportional  to  standard  deviations.  In  general,  when  the  probabil- 
ity of  double  censoring  is  low,  PR-EQ  is  recommended  but  for  higher  percentage  of 
doubly  censored  pairs  in  the  sample,  PR-STD  is  preferable. 
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Table  3.1.  ARE  of  S-EQ  and  S-STD  with  respect  to  S-OPT  (denoted  by  ARE  (i) 

and  ARE  (ii)). 


P 

Po 

ARE 

■ 0 

0.10 

0.25 

0.50 

0.75 

0.10 

(i) 

0.97313 

0.88831 

0.95639 

0.98947 

0.99181 

(ii) 

0.81839 

0.78922 

0.81070 

0.79269 

0.71277 

0.25 

(i) 

0.71128 

0.63964 

0.72288 

0.80301 

0.84046 

(ii) 

0.76117 

0.70965 

0.68951 

0.64460 

0.55690 

0.50 

(i) 

0.62495 

0.53139 

0.56235 

0.61371 

0.65178 

(ii) 

0.82248 

0.79317 

0.73470 

0.63457 

0.50993 

0.75 

(i) 

0.53421 

0.50480 

0.51459 

0.53496 

0.55072 

(ii) 

0.90504 

0.89858 

0.83849 

0.71376 

0.55388 
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Table  3.2.  ARE  of  W-EQ  and  W-STD  with  respect  to  optimal  W-OPT  (denoted  by 

ARE  (i)  and  ARE  (ii)). 


P 

Po 

ARE 

0 

0.10 

0.25 

0.50 

0.75 

0.10 

(i) 

0.98753 

0.91985 

0.97963 

0.99937 

0.99995 

(ii) 

0.92778 

0.91777 

0.93173 

0.91353 

0.84253 

0.25 

(i) 

0.73659 

0.66074 

0.75327 

0.83904 

0.87781 

(ii) 

0.86574 

0.81672 

0.80806 

0.77266 

0.68672 

0.50 

(i) 

0.63777 

0.53536 

0.56965 

0.62618 

0.66746 

(ii) 

0.87550 

0.84018 

0.79093 

0.70412 

0.58363 

0.75 

(i) 

0.53630 

0.50513 

0.51550 

0.53694 

0.55313 

(ii) 

0.91808 

0.91065 

0.85567 

0.73895 

0.58046 
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Table  3.3.  ARE  of  PR-EQ  and  PR-STD  with  respect  to  PR-OPT  (denoted  by  ARE 

(i)  and  ARE  (ii)). 


P 

Po 

ARE 

0 

0.10 

0.25 

0.50 

0.75 

0.10 

(i) 

0.99186 

0.92736 

0.98152 

0.99919 

0.99968 

(ii) 

0.98873 

0.98111 

0.98768 

0.97420 

0.91475 

0.25 

(i) 

0.74935 

0.66725 

0.76041 

0.84265 

0.87839 

(ii) 

0.95456 

0.91792 

0.92255 

0.90312 

0.83178 

0.50 

(i) 

0.65135 

0.53889 

0.57577 

0.63488 

0.67670 

(ii) 

0.96269 

0.92936 

0.91110 

0.87019 

0.78852 

0.75 

(i) 

0.54435 

0.50647 

0.51836 

0.54223 

0.56007 

(ii) 

0.97407 

0.96503 

0.94383 

0.89282 

0.80241 

54 


Table  3.4.  ARE  of  PR-OPT  with  respect  to  W-OPT  and  S-OPT  (denoted  by  ARE 

(i)  and  ARE  (ii)). 


P 

Po 

ARE 

0 

0.10 

0.25 

0.50 

0.75 

0.10 

(i) 

0.98112 

0.99321 

1.01768 

1.03522 

1.04729 

(ii) 

1.12185 

1.07298 

1.10958 

1.13946 

1.13924 

0.25 

(i) 

0.97441 

1.00542 

1.02191 

1.04029 

1.05422 

(ii) 

1.02119 

1.00917 

1.02238 

1.03888 

1.04759 

0.50 

(i) 

0.98607 

1.00782 

1.01435 

1.02457 

1.03339 

(ii) 

1.00777 

1.00053 

1.00188 

1.00554 

1.00896 

0.75 

(i) 

0.99598 

1.00302 

1.00539 

1.00927 

1.01148 

(ii) 

1.00080 

1.00002 

1.00012 

1.00056 

1.00105 
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(a)  po  = 0.10;  (bl  po  = 0.25; 
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Fig  3.4.  Efficacy  Comparison  for  p = 0.50 

(a)  po  = 0.10;  (bi  po  = 0.25; 

(c)  Po  = 0.50;  (d)  Po  = 0.75; 
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Fig  3.5.  Efficacy  Comparison  for  p = 0.75 

(a)  po  = 0.10;  (b)  po  = 0.25; 

(c)  Po  = 0.50;  (d)  Po  = 0.75; 


CHAPTER  4 


ESTIMATION  OF  OPTIMAL  WEIGHTS 

4.1  Introduction 

In  Chapter  3,  we  presented  a comparison  of  the  efficacies  of  statistics  in  the  PR 
and  WL-classes.  We  noted  that,  depending  upon  the  heaviness  of  double  censoring 
and  the  correlation  between  the  survival  times,  optimal  statistics  can  have  very  high 
efficacy  compared  to  other  members  in  a class. 

Since  the  weights  needed  to  compute  the  optimal  statistics  depend  on  the  un- 
known joint  distribution  of  the  observed  survival  times  in  a complicated  manner, 
practical  use  of  these  statistics  requires  reasonable  estimates  of  the  optimal  weights. 
In  this  chapter  we  describe  a method  of  performing  tests  based  on  optimal  statistics 
with  estimated  weights.  In  Section  4.2,  a model  is  proposed  for  the  conditional  dis- 
tributions of  the  \Zi\.  Section  4.3  has  theorems  needed  to  justify  the  application  of 
the  proposed  model  to  estimate  the  optimal  weights  while  Section  4.4  has  some  of 
the  estimates  of  optimal  weights  for  selected  distributional  forms.  Lastly,  Section  4.5 
has  the  forms  of  statistics  with  estimated  optimal  weights. 

4.2  Modelling  Conditional  Distributions 


It  is  seen  from  Theorem  3.4.3  that  the  weights  associated  with  optimal  statistics 
depend  upon  the  joint  distribution  of  (i^i,  1^2)  in  a complicated  way.  Specifying 
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realistic  joint  distributions  for  (Fi,  Y2)  is  difficult  because  such  a specification  involves 
the  joint  distribution  of  (Xi,  X2)  and  C.  In  this  chapter,  we  will  construct  estimates 
of  optimal  weights  assuming  the  following  model  for  the  conditional  distributions  of 

\Zi\  given  i G Bj,  j = 1,2, 3, 4. 

Consider  the  following  model  for  the  conditional  distribution  of  \Zi\  given  z G 
j = 1,2, 3,4. 


P{\Zi\>t\ieBi)  = 2[1 
P{\Zi\>t\ieB2)  = 2[1 
P{\Zi\>t\ieB3)  = 2[1  - 
P{\Zi\  >t  ieBi)  = 2[1  - H{tuO)], 


(4.2.1) 


where  H{-)  is  a c.d.f.  of  a random  variable  symmetric  about  zero. 

The  model  in  (4.2.1)  is  a reasonable  model  from  a practical  point  of  view  for  two 
reasons.  First  of  all,  the  observed  data  {\Zi\  z G Bj},  for  j = 1, 2, 3, 4,  can  be  used  to 
check  the  adequacy  of  the  model  for  various  choices  of  H.  A test  of  fit  or  a graphical 
approach  such  as  q-q  plot  can  be  used  for  this  purpose.  Secondly,  the  model  not  only 
possesses  a simple  structure,  but  also  is  flexible  enough  to  represent  a wide  variety  of 
distributional  forms.  Some  important  characteristics  of  the  model  are  listed  below. 


Under  Ho, 


{\Zi\  z G Si}  4 {\Zi\  z G B2} 


and 


{\Zi\  i G S3}  = {\Zi\  z G S4}, 


SO  that  9 = 1 corresponds  to  the  null  hypothesis.  Note  that  {\Zi\  i ^ B\\  and 
{\Zi\  z E B3}  need  not  be  the  same  even  under  Hq-  The  same  is  true  for  {\Zi\  z G B2} 
and  {\Zi\  i G 54}.  Thus,  the  model  accounts  for  the  fact  that  under  random 
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censoring,  the  distribution  of  \Zi\  from  uncensored  pairs  can  be  different  from  the 
distribution  of  jZij  from  singly  censored  pairs. 

If  A"  is  random  variable  with  a density  /(•),  symmetric  about  0 on  (—00,  00), 
then  the  density  of  |A|  is  2/(-)  on  [0,  00).  In  what  follows,  we  shall  refer  to  2/(-)  as 
the  folded  distribution  of  X.  The  distributions  in  (4.2.1)  are  the  folded  distributions 
of  random  variables  with  survival  distribution  2[1  — 

Fig  4.1  and  Fig  4.2  in  pages  82-83  represent  the  density  functions  corresponding 
to  standard  normal  distribution  and  a folded  standard  normal  distribution. 

From  the  model  in  (4.2.1),  the  subsurvival  distributions  of  {\Zi\,  i G Bj},  j = 
1,2,3  and  4 can  be  obtained  as 

Fre{t)  = 2Fre{0)[l-H{t)], 

F20{t)  = 2F2e{0)[l-H{t$)], 

F^eit)  = 2F30{O)  [1  - 

F^e{t)  = 2F49{0)[1-  H{tu0)].  (4.2.2) 

Notice  that  under  Ho  the  subsurvival  functions  are  denoted  by  Fj{t).  The  improper 
densities  corresponding  to  (4.2.2)  are  given  by 

Mt)  = 2Fie{0)h{t), 

f2e{t)  = 2F20{O)  e h{t0), 

f3e{t)  = 2F3e{Q)  v 

Ueit)  = 2F4e{0)ueh{tue).  (4.2.3) 

The  conditional  distribution  of  Zi  given  that  it  is  from  an  uncensored  pair  is  as 


follows. 
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P{Z,  > t 


/ 


I G B\  U B2)  — ^ 


V 


P(Zj>t,  i€Bi) 
F(i€BiUB2) 


. __  P(Zi>t,i^B2) 
^ P(t€BiUS2) 


if  f > 0 
if  ^ < 0 


f > ( I . € B.) 


. P(i€fli)+P(i€uej)^(l^’l  — * * £ B2) 


if  t > 0 
if  f < 0 


P(t€Bi) 

P(i^Bi)+P(ieBi) 


2[1  - H{t)] 


P(»€B;) 

P(i€Bi+P(ieB2) 


2[i  - H{-te)] 


if  < > 0 


if  < < 0. 


Thus,  the  conditional  density  of  Z,  given  the  difference  is  uncensored  is  given  by 


f{t  t G U B2)  — < 


B(»€Bi) 

B(teBi)+P(«6Bj) 


2h{t) 


if  t > 0 


. P(i6B!)+P(i€B2)  ^ if  ^ < 0 


Fi9(0) 

^ia(0)+-^2«(0) 


2h{t) 


if  t > 0 


^2e{p)  , 

^l«(0)+P2^(0) 


6 2h{—t6)  if  t < 0. 


(4.2.4) 


When  ifo  * ^ = 1 is  true,  (4.2.4)  represents  a symmetric  distribution.  Fig  4.3  on  page 
84  shows  the  density  of  an  uncensored  Zi  when  /i(«)  is  the  standard  normal  density 
and  6 = 1.1.  Notice  that  there  is  a point  of  discontinuity  at  Zi  = 0,  but  c.d.f.s  of 
{|Zi|,  i G Bj}^  j = 1,2,3  and  4 are  continuous. 

The  conditional  distribution  of  the  censored  differences  can  be  obtained  in  a 
similar  way.  The  density  function  is  given  by 
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f{t  z € 5s  U B4)  = < 


if  < > 0 


if(<o 


P{i^Bz  )+P(i€S4) 


= < 


(4.2.5) 


Recall  from  Theorem  3.4.1,  that  optimal  weights  depend  on  the  following  five 


integrals, 


rco 

7i  = / {Ai{t)  - A2{t))  fi{t)dt, 

Jo 

= f {A^it)  - A4(<))  fz{t)dt, 

Jo 

h = I"  {Ar{t)  - A^it))  fl  - 1 fi{t)dt, 


L. 


m) 


h = I {Ar{t)-A2{t)){l-F{t))fr{t)dt, 

Jo 

I,  = j"{A,{t)-A,{t))(l-\F(t))  Mt)dt. 


where  Aj{t)  are  defined  in  (3.3.2)  and 

m = E ^(1^*1  > ^ e 

i=i 

Using  (4.2.3)  in  the  definitions  of  Aj{t),  we  get. 


j4i(t)  — A2{t)  — 


d 

de 

d 


[log  fieit)  - log  f2e{t)] 

FieiO) 


e=i 


log 


F2e{0) 


— log  6h{tB) 


e=\ 


and 


(4.2.6) 


(4.2.7) 


Asit)  — A4{t)  — 


^ [log  f3e{t)  - log  f4e{t)] 


e=i 


d 

dS 


log  h{tu0) 


e=i 


(4.2.8) 
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Thus,  in  order  to  evaluate  the  integrals  in  (4.2.6),  in  addition  to  the  form  of  h{-),  we 
need  to  specify  the  derivatives  of 


at  ^ = 1. 
Now, 


F^ejO) 

F^eiO) 


FiejO) 

F20{O) 


Fi${0) 


F2e{0) 


Fie{0)  + i^2e(0)  ' Fig{0)  + i^2^(0) 


F3,?(0) 

F^giO) 


F^eiO) 


F^O) 


F3g(0)  + F4s(0)  ^ .^3^(0)  + i^4s(0) 


(4.2.9) 


(4.2.10) 


Therefore,  (4.2.9)  represents  the  odds  ratio  of  observing  a positive  difference  with 
respect  to  a negative  difference  given  that  the  difference  corresponds  to  an  uncensored 
pair.  Similarly,  (4.2.10)  represents  the  odds  ratio  of  observing  a positive  difference 
with  respect  to  a negative  difference  given  that  the  difference  comes  from  singly 
censored  pair.  We  will  refer  to  (4.2.9)  and  (4.2.10)  as  the  odds  ratio  for  uncensored 
and  censored  differences  respectively. 

Let  g{')  be  a function  such  that  g(0)  is  differentiable  at  ^ = 1 and  g'(l)  = 0.  We 
assume  that  the  odds  ratios  at  (4.2.9)  and  (4.2.10)  have  the  forms. 


Fig{0) 

F2g{0) 


= exp{g{9)), 


FzeiO) 

F^eid) 


exp{i/g{0)). 


(4.2.11) 


Note  that,  the  forms  for  the  odds  ratios  specified  in  (4.2.11)  imply  that  ^'(1)  and 
i^g'{l)  are  the  rates  at  which  these  ratios  are  changing  in  the  neighborhood  of  Hq  : 
^ = 1.  Also,  the  parameter  u used  in  (4.2.11)  is  same  as  the  one  used  in  (4.2.1). 
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It  follows  that  the  optimal  weights  under  (4.2.1)  and  (4.2.11)  depend  upon,  h{-), 
5'^(1)  and  u.  Since  2h{-)  represents  the  conditional  density  of  \Z\  under  Hq,  any 
density  on  [0,oo  is  a possible  candidate  for  2/i(-).  For  example,  the  exponential, 
folded  standard  normal,  folded  logistic  etc.  are  suitable  candidates.  The  value  of  v 
can  be  consistently  estimated  from  the  data,  by  noting  that 


1/  = log 


FjeiO)  / Fie{0) 

F,e{0)  ' F,,{^y 


(4.2.12) 


Since  F),'^(0),  j = 1,2, 3, 4,  can  be  consistently  estimated  by 


Afl(O)  = A^u+  / 

A^(0)  = AT„_  / n, 

FzeiO)  = Nc+  / n, 

Ao(0)  = iV^_  / n,  (4.2.13) 


a consistent  estimate  of  u is  given  by 

_ - logK- 

logNu+  — logNu- 


(4.2.14) 


The  value  of  ^'(1)  needs  to  be  specified  by  the  user  based  on  the  desired  rate 
of  departure  of  the  alternatives  from  the  null  hypothesis.  Higher  values  of  ^'(1) 
correspond  to  more  severe  departures.  Let  5-'(l)  = a and  we  will  refer  to  a as  the 
rate  of  departure  from  the  null  hypothesis.  Then  from  (4.2.11), 
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4.3  Evaluation  of  the  Integrals 


The  theorems  in  this  section  gives  expressions  for  7i  — I5  in  (4.2.6)  under  the 
assumption  that  (4.2.2)  and  (4.2.11)  hold.  The  expressions  are  given  in  terms  of  a 
parametric  folded  density  function  /i(-),  the  rate  of  departure  a and  censoring  elfect 
constant  u.  In  Section  4.4,  actual  forms  of  h — h will  be  given  when  /i(-)  is  an 
exponential,  a folded  standard  normal  or  a folded  logistic  density  function. 

Theorem  4.3.1  Let  a = g^(l)  and. 


h(t)  dt. 

e=i  ^ ’ 


Under  (4.2.2)  and  (4.2.11), 


h 


Proof.  By  definition. 


too 

h = / (^(i)  - A2{t))  h{t)dt  = aFi(O)  + ^Fi(O), 

Jo 

poo 

= / (Asit)  - A4{t))  fz{t)dt  = avFz{Q)  + (dFsiO). 

Jo 


Ai{t)-A2{t)  = [log  fie{t)  - log  f2e{t)] 


e=i 


de 


e=i 


Therefore, 


==  a - ^ [log  9h{te)] 


9=1 


h = J (Ai{t)  - A2{t))  fi{t)dt 

= a F^{0)  - 2Fi(0)  j -^[log  Oh{te)]  h{t)  dt 


a Fi(0)  + 13  Fi(0). 
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Similarly,  we  have 


h = J {Asit)  - A4{t))  h{t)dt 

= ai/ ^3(0)  - 2F3(0)  J -^[loguehitiyO)]\^^_^uh{tp)dt. 


Notice  that 


— [/o5^  veh{tp0)] 


e=i 


dO 

d 

le 


[log  p] 


+ -^{log 


e=i 


[log  6h{tu6)] 


e-i 


(4.3.1) 


Using  the  result  in  (4.3.1)  and  substituting,  = tv  in  the  second  term  of  I21  we  get 


2F, 


/J 

-j^[log  v9h{tv0)] 


de 


e-i 


vh{t  v)  dt 


= 2Fs{0)  I ^[log0h{t'e)] 


e=i 


h{t')  dt' 


= 2Fs{0)(3. 


Therefore, 


F 


ap  F3(0)  + P F3(0). 


The  next  theorem  gives  as  expression  for  I3  in  (4.2.6). 

Theorem  4.3.2  Let  a and  P be  as  in  the  Theorem  4.3.1  and  7 be  defined  by 


7 


= 4 r ^[(03  eh{t»)\  [1  - h(t)  dt. 


Under  (4.2.2)  and  (4.2.11), 


h = {Apt)  - A2{t))  fl-^ 


(0) 


fi{t)dt 


a 


= -Fx(0)  + (/3  + 7)Fi(0). 
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Proof.  From  Theorem  4.3.1,  (4.2.2),  (4.2.3)  and  (4.2.8),  we  can  write, 

h = j {M{t)  - A2{t))  fi{t)dt  - J (Ai(0  - A2{t))  fi{t)dt 


poo 

= aFi(0)  + /3Fa(0)-4Fi(0)y^  [a  - ^[log  eh{tO)] 


9=1 


[1  - H{t)]  h(t)dt. 


Using  the  transformation  [1  — H{i)]  = x,  we  get 

/•1/2 

/a  = aFi(O)  + /3Fi(0)  - 4Fi(0)  / axdx 

Jo 


d 

+4Fi(0)  0h{t9)] 


9=1 


[1  --  H{t)]  h{t)dt 


1 poo  d 

= aFi(O)  + /3Fi(0)  - -aFr(O)  + 4Fi(0)  —[log  0h{te)]  - H{t)]  h{t)dt 


= -Fr(0)  + (/3  + 7)i"i(0). 

The  next  theorem  gives  expressions  for  I4  and  I5  in  (4.2.6). 

Theorem  4.3.3  Let  a,  f3  and  7 be  as  in  Theorems  4.3.1  and  4.3.2.  Under  (4.2.2)  and 
(4.2.11), 

U = r {Ar{t)-A2{t)){l-F{t))h{t)dt 
Jo 


= (a+/3)Fr(0)-(a-27)F'(0) 


■00  / d 

-8Fi(0)F3(0)  / [a-—[logeh{t9)] 


0 


dO 


9=1 


[1  — H{t  u)]  h{t)  dt^ 


and 


I,  = r (Mt)  - A,{t))  (1  - ^F{t)'j  h(t)dt 
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= {at/ + (3)  FjiO) 


roo  / d 

- 4Fi(0)F3(0)  ( ai/  - -^[^og  veh{tv6)\ 

- 4F3^(0)  ^ (at/  - -^[iog  i/eh{tue)]  1^^^ 


e=i 


[1  — H{t  u)]  h{t  v)  dt 


[1  — H{t  j/)]  h{t  v)  dt. 


Proof.  Recall  from  (4.2.7),  that  under  Hq, 


F{t)  = 2iFr{t)  + F,it)). 


Therefore, 

roo 

h=  {A^{t)-A2{t)){l-F{t))Mt)dt 
Jo 

= J {Ar(t)  - A2{t))  ft{t)dt  - 2 Fi(0)  J {A^{t)  - A2{t))  Fi(<)  Mt)dt 

-2  F3(0)  j {Ar{t)  - A2{t))  Fsit)  h{t)dt. 


Using  Theorems  4.3.1  and  4.3.2, 


I,  = {a  + /3)F,{0)-{a-2'r)F^{0) 

rco  / d 

-8Fi(0)F3(0)  [oc  - -^[log  eh{te)] 


e=i 


[1  — H{t  u)]  h{t)  dt. 


Derivation  of  the  expression  for  follows  similarly. 

The  expressions  Ii  — Is  in  (4.2.6)  are  evaluated  in  the  next  Section  using  expo- 
nential, folded  standard  normal  and  folded  logistic  densities  for  /i(*).  For  any  other 
desired  density,  the  expressions  can  be  calculated  similarly. 


71 


4.4  Estimation  of  Optimal  Weights 

In  this  section,  expressions  for  given  in  (4.2.6)  are  derived  using  the  re- 

sults of  Theorems  4.3.1-4.3.3  for  three  forms  of  2h{-)  - exponential,  folded  standard 
normal  and  folded  logistic  distribution. 


Exponential  Distribution 


Notice  that  the  exponential  distribution  is  the  folded  double  exponential  distribu- 
tion. By  our  definition, 

2[1  - = e-‘^  for  t > 0, 

2eh{te)  = 


Therefore, 


d 

d6 


\log  6h{t0)] 


e=i 


(4.4.1) 


Using  (4.4.1)  we  can  evaluate  f3,  7 and  other  necessary  constants.  We  get. 


so  that. 


h=  {Ai{t)  - A2{t))  fi{t)  dt 

Jo 

= aFi(O), 

poo 

l2=  (A3(<)  - Aiit))  /3(f)  dt 

Jo 


auF3{0). 


(4.4.2) 
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Now,  7 is  given  by 


7 


= 4 


-^[log  eh{tO)] 


e=i 


[1  - H{t)]  h{t)  dt 


— t)  e dt 


1 


Therefore, 


L 


= /“  - Mt))  (i  - Mt)dt 


= i a F,(0)  + i F,(0). 


(4.4.3) 


Similarly,  to  evaluate  I4  and  Is,  we  note  that 


= 2 / (a  - 1 + 0 dt 

a - I 1 


au  — v9h{t  u9)] 
d9 


0=1 


[1  — H{t)]  h{t  v)  dt 


au  — -jAlog  9h{t  u9)] 
d9 


0=1 


[1  — H{tu)\  uh{tu)  dt 


au  — \ + ut)  e e u dt 


a-l  1 
+ 4- 


2 
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Therefore, 


h = j (A,{t)  - A^it))  {1  - F{t))  h{t)dt 


c.f,(0)  - (a  - i)Fi'(O)  - 2F.(0)F3(0) 


a — 1 1 

+ 


1 + 1/  (1  + ^)‘ 


, (4.4.4) 


and 


Is  — J (243(f)  ~ -^4(f)) 


= avF,m  - (y  - i)  - ‘'fi(O)W) 


au  — 1 V 

+ 


1 + (1  + uY 


.(4.4.5) 


Folded  Standard  Normal  Distribution 


For  folded  standard  normal  distribution,  by  our  definition. 


2[1  - H{td)] 


for  f > 0, 


so  the  corresponding  density  is 


2Bh{te) 


Therefore, 


A 

le 


[log  6h{t9)] 


e=x 


(4.4.6) 


Using  (4.4.6)  we  can  evaluate  /3,  7 and  other  necessary  terms  in  Theorems  4.3. 1-4. 3. 3. 


First  we  have. 


13 


^ L=i 
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so  that, 


= -2 


= 0, 


■?  \ t — f ^ 

1 - t^)  e 


7T 


h = ai^i(O), 

/2  = ai/F3(0) 


(4.4.7) 


Now,  7 is  given  by 


7 = 4/  — 


8=1 


[1  - Hit)]  h{t)dt 


1 12 

-=  e"5“  du 

t 


1 1*2 
e 


- = ‘‘  dt 


- ^f\r^ 

J Jt  \/i 


e"^“'du 


"^  / i 71^ " 


1 _i*2 

e 


\/^ 


dt 


du 


t 1 *2 


V2 


dt 


7T 


= 0.5-0.181616 

= 0.31838 

1 



7T 


Therefore, 

/,  = /^“  (AM  - AM)  (l  - /.(')* 

= ^ a F,(0)  + i F,(0).  (4.4.8) 

Z 7T 

Expressions  for  I4  and  I5  do  not  have  closed  forms  for  folded  standard  normal  distri- 
butions. After  estimating  the  value  of  i/,  the  values  have  to  be  evaluated  numerically. 
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Folded  Logistic  Distribution 


For  folded  logistic  distribution,  by  our  definition, 


2[1 = 

20h{t9)  = 


2e 


-te 


1 + ’ 

20e-^^ 

(1  + 


for  t > 0, 


Therefore, 


de 


[log  eh{t$)] 


2te"‘ 

<?=i  “ ^ ^ (1  + e-‘) 


(4.4.9) 


Using  (4.4.9)  we  can  evaluate  /3,  7 and  other  necessary  terms  in  Theorems  4.3. 1-4. 3. 3. 


First,  (3  is  given  by 


13  = 


Therefore, 


-2  j -^[log  9h{te)] 


= -2 


d9 

J h{t)  dt 

1 + 2 X 0.443147 


^=1 
2te"‘ 


(1  + e 


-t 


- t 


(1  + e~‘)2 


dt 


= -0.113706. 


h = aFi(O)  - 0.113706, 
I2  = ai/F3(0)-  0.113706, 


(4.4.10) 


7 = 4/  -^[log  9h{t9)]  1^^^  [1  - H{t)]  h{t)  dt 

= '/  {'-‘  + 1 


+ e~‘/  (1  + e ‘) 


Now,  7 is. 
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= 0.5  + 4 


= 0.5-4x0.01324 


2t 


(1  + e-‘)3 


dt 


= 0.44704. 


Therefore 

^3  = ^ a Fi(0)  + (0.44704  - 0.113706)  Fi(0) 

- ^ «^i(0)  + ^ Fi(0).  (4.4.11) 

For  folded  logistic  distribution  expressions  and  I5  do  not  have  closed  forms.  After 
estimating  the  value  of  j/,  the  values  of  I4  and  J5  can  be  evaluated  numerically. 


4.5  Forms  of  Estimated  Optimal  Statistics 


In  this  section  the  forms  of  the  optimal  statistics  are  presented  with  estimated 
optimal  weights.  Consistent  estimators  are  used  for  estimating  optimal  weights  in 
the  PR  and  Wl-classes  of  statistics. 

The  expressions  of  estimated  optimal  W^L  Sign  statistic  are  given  here  for  expo- 
nential, folded  standard  normal  and  folded  logistic  distribution. 

Optimal  WL  Sign  Statistic 


The  WL  Sign  statistic  has  the  form, 


T = (1-A) 


1 


•PFW 


d{N,(t)  - N,(t))  + A 


1 


\Z2F3(0) 


= (i-A)r;„  + Ar;„ 


J d(N^{t)  - N^it)) 
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where  A is  an  estimate  of  the  optimal  A.  This  estimate  can  be  obtained  as  follows. 
From  Theorem  3.4.3,  optimal  A is  given  by 


A 


'5ti 


'5U 


+ C 


sc 


where  and  Cgc  are 


Csu  = (efficacy  of  Tgu)  ^ 

v/2Fi(0) 

f (Ai(t)  - A2(t))  fi{t)dt' 


(efficacy  of  Tgc) 


1 

2 


/ {Az{t)  - A4{t))  f3{t)dt 


A consistent  estimate  of  the  optimal  A is  given  by 


A 


su 


+ c;. 


Substituting  the  estimates  of  Ii  and  I2  from  Section  4.4,  we  have  estimates  of  Cju  and 


'5C  J 


'SU 


(a  + /3)Fi(0)  {oc  + /3)y/7^' 


V2 


and 


SC 


^ y/2F3(0)  ^ 

(ar>  + f3)F3(0)  + P)\/Nc+ 


V2 


Therefore,  the  simplified  form  of  the  estimated  optimal  WL  Sign  statistic  when 
{\Zi\  i £ Bj},  j = 1,2, 3, 4 has  exponential  or  folded  standard  normal  distribution 
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is  given  by 


S-OPT  = 


d{Ni{t)  - N2{t))  + i>  / d{N3{t)  - N^it)) 


Similarly,  the  simplified  form  of  the  estimated  optimal  WL  Sign  statistic  when 
{\Zi\  i € Bj}  j = 1,2, 3, 4 has  folded  logistic  distribution,  can  be  shown  to  be  given 

by 


S-OPT  = 


J d{N,(t)  - %(<))  + 


ai>  - 0.113706 


113706 


J diN^it)  - N,{t)) 


The  estimated  optimal  PR  statistics  are  now  given  when  {\Zi\  i G Bj}  j = 1, 2, 3, 4 


has  exponential,  folded  standard  normal  and  folded  logistic  distribution. 


Optimal  PR  Statistic 


The  PR-class  of  statistics  is  represented  as 

T = (1  - j)  / (1  - K(t))  d(N^(^)  - 

vPiw-' 

+ A ^=1=  / d(Ndt)  - N,(t)) 

= (1  - A)  + A 


To  estimate  optimal  value  of  A we  need. 


ciu  = (efficacy  of  Tiu)  = 

V|Fi(0) 

S - A^(t))  {\  - F^(t))  h(t)di' 
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and 


(efficacy  of  Tic) 


1 

2 


\/2f3(0) 

/ {A3{t)  - Aiit))  f3{t)dt' 


Substituting  expressions  of  I3  and  from  Section  4.4, 


we  have  estimates  of  Ci^  and 


and 


\/iA(o)  ^ 

(|a + /3 + 7)  Fi(0)  {\a  + 0 + j)  y/^u+ 

_ yj2F3{0)  _ 

{au  + f3)F3{0)  ~ (««>  + 


Therefore,  the  simplified  form  of  the  estimated  optimal  PR  statistic  when  {\Zi 


i e 


Bj)  for  j = 1,2, 3, 4 has  exponential  distribution,  is  given  by 


PR  - OPT 


I {1  - F^{t))  d{Ni{t)  - N^it)) 

+ 3(^7)  / ■ 


The  simplified  form  of  the  estimated  optimal  PR  statistic  when  {\Zi 
j = 1,2, 3, 4 has  folded  standard  normal  distribution  is  given  by 


i G Bj)  for 


PR  - OPT 


J {1  - F^{t))  d{Ni{t)  - N^it)) 

+ 3.)  / - Nt(t))  . 
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The  simplified  form  of  the  estimated  optimal  PR  statistic  when  {\Zi\ 
j = 1,2, 3, 4 has  folded  logistic  distribution  is  given  by 


i 6 Bj}  for 


PR  - OPT 


j (I  - F,(t))  d{N,(t)  - N,{t)) 

2 3 / 


The  estimated  optimal  WL  Wilcoxon  signed  rank  statistic  when  {\Zi 
for  j = 1,2,  3, 4 has  exponential  distribution  is  given  next. 


i e B,} 


Optimal  WL  Wilcoxon  Signed  Rank  Statistic 


WL  Wilcoxon  Signed  rank  statistic  is  given  by 

T = — i— f (I  - F{t))  d{N,{t)  - N,(t)) 

,J2  S(1  - F{t)r  Mt)  dt  -I 

+ \- ^ , hl-lF(t))d{N,{t)-N,{t)) 

^2/(l-lF(())  Mt)dt 


= (1  — A)  T*„  + A T*^. 

Here  c„,u  and  Cu,c  are  given  by 


= (efficacy  of  Tu,u)  5 


^2  HI  - F(t)Y  Mt)dt 
S{A,(t)  - A2(t))  (1  - F(t))  Mt)dt' 


and 


'WC 


(efficacy  of  T^c) 


1 

2 
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S - \F{t)f  h{t)dt 
f (Asit)  - AM))  (1  - ^F(t))  f3(t)dt' 


In  Section  4.3,  we  have  mentioned  that  U and  h given  in  (4.2.6)  do  not  have  closed 
forms  when  folded  logistic  and  folded  standard  normal  densities  are  used  for  /i(’). 

Using  Theorem  4.3.3  and  Proposition  2.1  in  Dabrowska,  (1990),  the  estimates  of 
c-uiu  and  c,uc  necessary  to  construct  an  estimated  optimal  statistic  for  the  PR-class  are 

A 

given  below,  with  exponential  density  used  for  /i(-).  Recall  that  S{i)  is  Kaplan-Meier 
survival  function  as  described  in  Chapter  2. 

. ^ »(eLi  j(i  - 

aJV„+  - (a  - 0.5)W.\  + 


Cxjuc 


n{Ei=3  /(1-0.55(0)WO 


0.5 


z>aATe+  - (O.SDa  - 0.25)iV2^  ^ 2N^+N,+  [^  - 


The  optimal  WL  Wilcoxon  signed  rank  statistic  when  {\Zi\  i E Bj}  loi  j — 
1,2, 3, 4 has  exponential  distribution  is. 


WL-OPT  = (i-A)r:„  + Ar:„ 


where 


'WU 


^WU  C 


wc 


For  any  other  special  case  of  h{-),  optimal  statistics  of  the  PR  or  WL-class  can  be 
derived  by  following  similar  procedure  as  shown  in  Sections  4.3  and  4.4. 
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Fig  4.1.  Standard  Normal  Density. 
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«|r!) 


z 


Fig  4.2.  Folded  Standard  Normal  Density. 
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z 


Fig  4.3. 


t € U B2)  for  0 = 1.1,  using 


Folded  Standard  Normal  Density  for  h{-). 


CHAPTER  5 


SIMULATION  RESULTS 
5.1  Introduction 

The  expressions  of  the  estimated  optimal  weights  in  the  PR  and  WL-classes  are 
given  in  Chapter  4 for  exponential,  folded  standard  normal  and  folded  logistic  distri- 
butions. Some  of  the  integrals  involved  in  determining  optimal  weights  for  the  WL 
Wilcoxon  signed  rank  statistic  do  not  have  closed  forms.  Therefore,  the  estimated 
optimal  weights  in  these  cases  could  not  be  listed  for  all  the  densities  we  considered. 
In  this  chapter,  we  compare  powers  of  several  relevant  statistics  using  a simulation 
study.  The  results  of  simulation  are  presented  in  Section  5.2. 

5.2  Simulation  Study 

A simulation  study  was  performed  using  the  model  given  in  (4.2.1)  and  (4.2.11). 
Three  functional  forms,  each  with  support  [0,  oo),  were  considered  for  2/i(*)  defined 
at  (4.2.1).  They  were 

the  exponential, 

2uh{t  v)  = ^ 

the  folded  standard  normal  distribution, 

2vh{tv)  = 

\/  llT 


85 


86 


and  the  folded  logistic, 


2uh{t  u) 


= 2 


ue 


— ut 


(1  + e 


-l/t 


y 


V. 


Two  values  u = 0.75  and  v = 1.25  were  considered  for  the  censoring  parameter 
Our  choice  for  the  values  for  i>  was  based  on  the  following  considerations. 


1.  When  1/  < 1.0,  under  ffo,  the  standard  deviation  of  the  distribution  of 
censored  Z is  smaller  than  that  of  an  uncensored  Z.  This  implies  that, 
in  most  singly  censored  pairs,  the  censored  observations  are  not  very 
high  compared  to  the  true  survival  times  of  the  uncensored  observations. 
Thus,  u < 1 implies  that  the  censored  observations  are  not  very  high 
compared  to  the  uncensored  observations  in  singly  censored  pairs.  We 
choose  V = 0.75  to  study  this  case. 

2.  When  1/  > 1.0,  the  standard  deviation  of  the  distribution  of  censored  Z 
is  greater  than  that  of  an  uncensored  Z.  So,  in  many  of  the  singly  cen- 
sored pairs,  censored  observations  are  much  larger  than  the  corresponding 
uncensored  survival  times.  Thus,  u > \ implies  that  the  censored  observa- 
tions are  much  higher  than  the  uncensored  observations  in  singly  censored 
pairs.  We  chose  i/  = 1.25  to  study  this  case. 


For  the  odds  ratio  in  (4.2.11),  we  selected  g[9)  = a(^  — 1),  and  a = 0.6.  Therefore, 

= exp{oc{6  — 1)), 

and 


^3^(0) 

^4^(0) 


= exp{ai/{0  — 1)). 


Power  and  size  of  a 0.05  level  test  based  on  each  statistic  was  estimated  by  gen- 
erating 500  random  samples  of  sizes  n = 100  and  n = 40.  The  alternatives  were 
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specified  by  selected  values  of  9.  Recall  that  for  certain  statistics  (e.g.  PR-OPT)  es- 
timation of  optimal  weights  require  specification  of  a value  for  a.  During  simulation, 
by  specifying  a value  for  a alternative  conditional  distributions  of  the  \Zi\  are  also 
specified.  Therefore,  we  are  interested  in  studying  the  effect  of  using  a wrong  value 
of  a for  estimation  of  optimal  weights.  Powers  of  the  test  statistics  were  estimated 
when  the  optimal  weights  were  calculated  using  a value  of  a different  from  0.6.  Two 
of  the  values  specified  were  higher  than  the  a = 0.6  used  for  generating  samples.  The 
values  were  a = 1 and  a = 1.5.  The  other  two  specified  values  were  smaller  than  0.6, 
a = 0.5  and  a = 0.3.  These  values  for  rate  a were  chosen  arbitrarily. 

Samples  were  generated  such  that  P{i  e Bi  U B2)  = 0.3,  P{i  e B^U  B4)  = 0.6 
and  P(i  G 5b)  = 0.1.  Thus  the  expected  proportion  of  uncensored,  singly  censored 
and  doubly  censored  pairs  were,  respectively,  30%,  60%  and  10%. 

Actual  simulation  of  random  samples  was  performed  in  two  stages.  In  the  first 
stage  a random  sample  of  size  n was  selected  from  an  uniform  [0, 1]  population.  Let 
the  observations  be  denoted  by  z = 1,2, . . . 100.  In  the  second  stage,  the  sample 
of  the  differences  (z)  were  generated.  The  observations  from  the  uniform  distribution 
are  used  to  determine  the  censoring  status  and  the  sign  of  the  differences.  Here  is 
an  example  of  how  it  is  done  under  the  case  of  null  hypothesis  (^  = 1)  where  the 
expected  number  of  positive  differences  is  same  as  the  number  of  negative  differences 
among  both  censored  and  uncensored  differences.  Since  the  probability  is  0.60  that 
an  observation  is  from  an  uncensored  pair,  we  let 


G Bi)  = h{\zi\),  if  0 < u,-  < 0.30, 


ie  Bi)  = h{\z,\  9),  if  0.30  < < 0.60. 
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Similarly  to  generate  censored  differences, 


i G B3)  = h{\zi\  I/), 


if  0.60  < Ui  < 0.75, 


and 


f{\zi\  i G B4)  = h{\zi\  uO)^  if  0.75  < < 0.90. 


From  each  sample,  the  following  statistics  were  calculated. 


1.  PR-EQ  : PR  statistic  with  equal  weights. 


2.  PR-STD  : PR  statistic  with  weights  proportional  to  standard  deviations. 

3.  PR-OPT  : PR  statistic  with  optimal  weights. 

4.  PR-OPT(l),  PR-OPT(2),  PR-OPT(3)  and  PR-OPT(4)  : PR  statistic 
with  optimal  weights  and  a = 1.0,  a = 1.5,  a = 0.30  and  a = 0.10 
respectively. 


5.  S-EQ  : WL  Sign  statistic  with  equal  weights. 

6.  S-STD  : WL  Sign  statistic  with  weights  proportional  to  standard  devia- 
tions. 

7.  S-OPT  : WL  Sign  statistic  with  optimal  weights. 

8.  S-OPT(l),  S-OPT(2),  S-OPT(3)  and  S-OPT(4)  : WL  Sign  statistic  with 
optimal  weights  and  a = 1.0,  a = 1.5,  a = 0.30  and  a = 0.10  respectively. 

9.  W-EQ  : WL  Wilcoxon  signed  rank  statistic  with  equal  weights. 

10.  W-STD  : WL  Wilcoxon  signed  rank  statistic  with  weights  proportional 
to  standard  deviations. 
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Notice  that  in  this  list  of  statistics  considered  for  simulation  study,  optimal  WL 
Wilcoxon  signed  rank  statistic  (W-OPT)  was  not  included.  This  statistic  was  left 
out  on  purpose  because  of  two  reasons.  In  Chapter  3,  we  have  seen  that  when  there 
is  association  (which  is  usually  the  case  with  bivariate  data)  within  the  pairs,  the 
efficacy  of  PR-OPT  is  always  higher  than  the  efficacy  of  W-OPT.  Secondly,  the  form 
of  the  estimated  W-OPT  is  very  complicated  and  does  not  have  a closed  form  in 
most  cases  we  are  considering.  Consequently,  computation  of  WL-OPT  involves  a 
complicated  procedure  which  requires  enormous  computer  time.  Considering  all  these 
factors  we  decided  not  to  include  W-OPT  in  the  simulation  study. 

For  estimating  power  of  the  optimal  statistics,  the  following  test  procedure  was 
used.  If  any  of  the  estimated  odds  ratios  are  equal  to  1,  then  the  statistic  with  weights 
proportional  to  standard  deviations  is  used  instead  of  the  optimal  statistic.  This 
procedure  was  used  because  when  any  of  the  estimated  odds  ratios  are  1,  estimation 
of  optimal  weights  is  not  possible. 

First  let  us  look  at  the  powers  of  tests  when  sample  size  is  large  (n  = 100).  These 
cases  are  presented  in  the  Tables  5. 1-5.6.  Notice  that  in  each  table,  estimated  level  of 
test  is  within  0.05  — and  0.05  + 2 \ within  (0.0305,  0.0695). 

In  Table  5.1,  (\Zi\  z G 5j),  j = 1,2,  3, 4,  have  exponential  distributions,  and 
1/  = 0.75.  Among  the  PR-class  of  statistics,  PR-EQ  performed  better  than  PR-STD. 
But,  there  was  almost  30%  (power  of  PR-EQ=0.302  vs  power  of  PR-OPT=:0.410) 
improvement  of  power  near  the  null  hypothesis  when  PR-OPT  was  used  instead  of 
PR-EQ.  As  we  move  further  away  from  the  null  hypothesis,  the  improvement  in 
power  resulting  from  the  use  of  the  optimal  statistic  in  any  class  is  not  so  much. 
This  is  as  expected  because  any  reasonable  statistic  should  have  high  power  when 
the  alternative  hypothesis  is  very  far  away  from  the  null  hypothesis.  Among  Sign 
statistics,  S-STD  performed  better  than  S-EQ.  Near  the  null  hypothesis,  there  was 
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40%  improvement  of  power  by  using  S-OPT.  Although  powers  of  W-EQ  and  W-STD 
were  higher  than  that  of  S-OPT,  in  some  cases,  they  were  always  less  than  that  of 
PR-OPT. 

As  we  have  mentioned  earlier,  we  are  interested  to  see  how  powers  of  optimal 
statistics  are  affected  by  wrong  specification  of  a.  From  each  table,  it  is  clear  that, 
for  S-OPT  powers  are  not  at  all  affected  by  specifying  values  of  a that  are  different 
from  0.6.  Recall  that  a = 0.6  was  used  in  specifying  the  alternative  hypothesis.  For 
PR-OPT,  when  the  value  of  a specified  for  calculation  of  optimal  weights  is  lower 
than  the  true  a,  in  general  there  is  some  loss  of  power. 

In  Table  5.2  {\Zi\  i £ Bj)^  j = 1,2, 3, 4,  have  exponential  distributions  and 
u = 1.25.  The  powers  of  W-EQ  and  W-STD  were  sometimes  higher  than  that  for 
PR-OPT  and  S-OPT.  Although  near  the  null  hypothesis  there  is  almost  18%  (power 
of  PR-EQ=0.372  vs  power  of  PR-OPT=0.438)  increase  in  power  resulting  from  the 
use  of  the  PR-OPT  than  any  other  statistics  in  the  PR-class,  the  powers  of  W-EQ  and 
W-STD  are  not  very  low  compared  to  PR-OPT  at  alternatives  substantially  different 
from  the  Hq.  In  this  case,  either  PR-OPT,  W-EQ  or  W-STD  should  be  used. 

In  Table  5.3,  the  case  of  (|Zi|  i £ Bj)^  j = 1,2, 3, 4,  with  folded  standard 
normal  distributions  and  u = 0.75  is  presented.  Here,  members  of  the  PR-class  are 
performing  very  well.  Powers  of  W-EQ  and  W-STD  both  were  low  compared  to  the 
powers  of  the  rest  of  the  statistics  in  our  list.  In  this  case,  the  power  of  PR-STD  is 
higher  than  that  of  PR-EQ  and  near  the  improvement  in  power  resulting  from 
the  use  of  PR-OPT  is  almost  30%.  Here,  the  choice  of  statistic  to  be  used  is  PR-OPT. 

In  Table  5.4,  where  {\Zi\  i £ Bj)^  j = 1,2,  3, 4,  have  folded  standard  normal 
distributions  and  u = 1.25.  The  performances  of  W-EQ  and  W-STD  were  poor 
compared  to  the  performance  of  other  statistics  in  the  list.  It  appears  from  Tables 
5. 3-5. 4,  that  when  the  conditional  distribution  of  absolute  differences  IZA  have  folded 
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standard  normal  distributions,  the  powers  of  both  W-EQ  and  W-STD  will  be  low 
and  the  choice  of  statistic  to  be  used  is  PR-OPT. 

In  Table  5.5,  when  {\Z{\  i e Bj),  j = 1,2, 3, 4,  have  folded  logistic  distributions 
and  u = 0.75,  members  of  the  PR-class  have  powers  higher  than  the  statistics  in 
WL-class.  In  this  situation  the  recommended  statistic  is  PR-OPT.  There  is  almost 
40%  increase  in  power  near  the  null  hypothesis  resulting  from  the  use  of  the  optimal 
statistic  in  the  PR-class  instead  of  any  other  member  from  the  same  class.  Although 
the  improvement  in  power  resulting  from  using  S-OPT  instead  of  S-EQ  or  S-STD  is 
more  than  40%,  power  of  S-OPT  is  still  much  lower  than  the  power  of  PR-OPT. 

In  Table  5.5,  when  {\Zi\  i € Bj),  j = 1,2,  3, 4,  have  folded  logistic  distributions 
and  V = 1.25,  the  power  of  PR-OPT  is  higher  than  the  power  of  the  other  statistics. 
But,  in  this  case,  W-EQ  and  W-STD  have  higher  powers  than  S-EQ  and  S-STD. 

Now  we  consider  the  case  of  moderate  (n  = 40)  sample  sizes.  Here  also,  in  each 


table,  estimated  level  of  test  is  within  0.05  — and  0.05  -|-  he. 

within  (0.0305,  0.0695). 

For  {\Zi\  i G Bj),  j — 1,2,3, 4,  having  exponential  distributions,  when  v = 0.75, 
PR-OPT  is  performing  well  overall.  Also,  both  W-EQ  and  W-STD  are  performing 
better  than  the  Sign  statistics,  even  better  than  the  optimal  Sign  test  as  we  move 
away  from  the  null  hypothesis.  From  Table  5.8,  when  v = 1.25,  PR-OPT  has  high 
power  near  i7o,  but  W-EQ  and  W-STD  also  are  performing  very  well.  Also,  it  is  clear 
that  the  power  of  PR-OPT  is  not  very  high  compared  to  W-EQ  when  9 > 1.33.  In 
this  case  the  choice  is  between  PR-OPT  and  W-EQ. 

Like  the  case  of  n = 100,,  when  {\Zi\  i G Bj),  j = 1,2, 3, 4,  have  folded  standard 
normal  distributions,  W-EQ  and  W-STD  are  performing  poorly  compared  to  the 
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other  statistics.  From  Tables  5.9  and  5.10,  the  statistic  which  performs  best  is  PR- 
OPT.  The  same  is  true  for  both  the  values  of  u when  conditional  distribution  of  \Z{ 
has  folded  logistic  density. 

Therefore,  it  appears  that  in  most  of  the  cases  we  have  considered,  PR-OPT 
emerges  as  the  best  statistic.  Also,  recall  that  for  any  member  of  the  PR-class,  an 
exact  test  can  be  performed  for  small  samples.  Only,  when  {\Zi\  i G 5^),  j = 1,2, 3, 4, 
have  exponential  distribution,  powers  of  W-EQ  and  W-STD  are  very  close  to  that  of 


PR-OPT. 
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Table  5.1.  Power  of  the  tests  for  Exponential  distribution  with  a = 0.6, 

u = 0.75.  and  sample  size  = 100. 


Tests 

II 

• 

o 

o 

e = 1.33 

e = 1.66 

e = 2.00 

PR-EQ 

0.040 

0.302 

0.656 

0.900 

PR-STD 

0.040 

0.286 

0.628 

0.888 

PR-OPT 

0.040 

0.410 

0.740 

0.964 

PR-OPT(l) 

0.040 

0.410 

0.742 

0.964 

PR-OPT(2) 

0.040 

0.414 

0.736 

0.964 

PR-OPT(3) 

0.040 

0.408 

0.740 

0.964 

PR-OPT(4) 

0.040 

0.374 

0.732 

0.960 

S-EQ 

0.042 

0.244 

0.516 

0.800 

S-STD 

0.052 

0.260 

0.538 

0.828 

S-OPT 

0.052 

0.380 

0.634 

0.898 

S-OPT(l) 

0.052 

0.380 

0.634 

0.898 

S-OPT(2) 

0.052 

0.380 

0.634 

0.898 

S-OPT(3) 

0.052 

0.380 

0.634 

0.898 

S-OPT(4) 

0.052 

0.380 

0.634 

0.898 

W-EQ 

0.056 

0.322 

0.734 

0.942 

W-STD 

0.044 

0.316 

0.702 

0.924 
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Table  5.2.  Power  of  the  tests  for  Exponential  distribution  with  a = 0.6, 

v = 1.25.  and  sample  size  = 100. 


Tests 

0 = 1.00 

e = 1.33 

e = 1.66 

0 ---  2.00 

PR-EQ 

0.040 

0.372 

0.794 

0.972 

PR-STD 

0.040 

0.362 

0.782 

0.962 

PR-OPT 

0.040 

0.438 

0.830 

0.976 

PR-OPT(l) 

0.040 

0.440 

0.828 

0.976 

PR-OPT(2) 

0.040 

0.446 

0.826 

0.976 

PR-OPT(3) 

0.040 

0.436 

0.826 

0.976 

PR-OPT(4) 

0.040 

0.388 

0.784 

0.974 

S-EQ 

0.042 

0.294 

0.650 

0.914 

S-STD 

0.052 

0.300 

0.642 

0.920 

S-OPT 

0.052 

0.406 

0.726 

0.942 

S-OPT(l) 

0.052 

0.406 

0.726 

0.942 

S-OPT(2) 

0.052 

0.406 

0.726 

0.942 

S-OPT(3) 

0.052 

0.406 

0.726 

0.942 

S-OPT(4) 

0.052 

0.406 

0.726 

0.942 

W-EQ 

0.056 

0.410 

0.844 

0.982 

W-STD 

0.042 

0.402 

0.848 

0.982 
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Table  5.3.  Power  of  the  tests  for  Folded  Standard  Normal  distribution  with  a = 0.6, 

u = 0.75.  and  sample  size  = 100. 


Tests 

0 = 1.00 

d = 1.33 

0 = 1.66 

9 = 2.00 

PR-EQ 

0.038 

0.356 

0.732 

0.942 

PR-STD 

0.038 

0.316 

0.694 

0.920 

PR-OPT 

0.038 

0.458 

0.832 

0.978 

PR-OPT(l) 

0.038 

0.460 

0.818 

0.978 

PR-OPT(2) 

0.038 

0.464 

0.814 

0.978 

PR-OPT(3) 

0.038 

0.452 

0.834 

0.978 

PR-OPT(4) 

0.038 

0.418 

0.828 

0.978 

S-EQ 

0.042 

0.244 

0.516 

0.800 

S-STD 

0.052 

0.260 

0.538 

0.828 

S-OPT 

0.052 

0.380 

0.634 

0.898 

S-OPT(l) 

0.052 

0.380 

0.634 

0.898 

S-OPT(2) 

0.052 

0.380 

0.634 

0.898 

S-OPT(3) 

0.052 

0.380 

0.634 

0.898 

S-OPT(4) 

0.052 

0.380 

0.634 

0.898 

W-EQ 

0.044 

0.202 

0.492 

0.754 

W-STD 

0.050 

0.202 

0.474 

0.740 
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Table  5.4.  Power  of  the  tests  for  Folded  Standard  Normal  distribution  with  a = 0.6, 

V = 1.25.  and  sample  size  = 100. 


Tests 

II 

• 

O 

o 

6 = 1.33 

d = 1.66 

e = 2.00 

PR-EQ 

0.038 

0.420 

0.838 

0.990 

PR-STD 

0.038 

0.404 

0.826 

0.982 

PR-OPT 

0.038 

0.480 

0.874 

0.992 

PR-OPT(l) 

0.038 

0.478 

0.868 

0.992 

PR-OPT(2) 

0.038 

0.480 

0.862 

0.992 

PR-OPT(3) 

0.038 

0.484 

0.876 

0.992 

PR-OPT(4) 

0.038 

0.436 

0.858 

0.992 

S-EQ 

0.042 

0.294 

0.650 

0.914 

S-STD 

0.052 

0.300 

0.642 

0.920 

S-OPT 

0.052 

0.406 

0.726 

0.942 

S-OPT(l) 

0.052 

0.406 

0.726 

0.942 

S-OPT(2) 

0.052 

0.406 

0.726 

0.942 

S-OPT(3) 

0.052 

0.406 

0.726 

0.942 

S-OPT(4) 

0.052 

0.406 

0.726 

0.942 

W-EQ 

0.054 

0.274 

0.622 

0.884 

W-STD 

0.058 

0.266 

0.626 

0.888 
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Table  5.5.  Power  of  the  tests  for  Folded  Logistic  distribution  with  a = 0.6, 

1/  = 0.75.  and  sample  size  = 100. 


Tests 

II 

• 

o 

o 

CO 

CO 

• 

1 — 1 

II 

e = 1.66 

e = 2.00 

PR-EQ 

0.062 

0.360 

0.680 

0.950 

PR-STD 

0.062 

0.334 

0.666 

0.922 

PR-OPT 

0.062 

0.478 

0.798 

0.976 

PR-OPT(l) 

0.062 

0.474 

0.794 

0.976 

PR-OPT(2) 

0.062 

0.468 

0.790 

0.974 

PR-OPT(3) 

0.062 

0.452 

0.794 

0.972 

PR-OPT(4) 

0.062 

0.336 

0.708 

0.940 

S-EQ 

0.042 

0.244 

0.516 

0.798 

S-STD 

0.052 

0.260 

0.538 

0.828 

S-OPT 

0.052 

0.380 

0.632 

0.898 

S-OPT(l) 

0.052 

0.380 

0.632 

0.898 

S-OPT(2) 

0.052 

0.380 

0.632 

0.898 

S-OPT(3) 

0.052 

0.380 

0.632 

0.898 

S-OPT(4) 

0.052 

0.380 

0.632 

0.898 

W-EQ 

0.056 

0.210 

0.478 

0.750 

W-STD 

0.054 

0.202 

0.476 

0.746 
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Table  5.6.  Power  of  the  tests  for  Folded  Logistic  distribution  with  a = 0.6, 

1/  = 1.25.  and  sample  size  = 100. 


Tests 

9 = 1.00 

9 = 1.33 

9 = 1.66 

9 = 2.00 

PR-EQ 

0.062 

0.366 

0.706 

0.956 

PR-STD 

0.062 

0.344 

0.690 

0.932 

PR-OPT 

0.062 

0.480 

0.814 

0.976 

PR-OPT(l) 

0.062 

0.474 

0.810 

0.976 

PR-OPT(2) 

0.062 

0.470 

0.806 

0.974 

PR-OPT(3) 

0.062 

0.452 

0.814 

0.974 

PR-OPT(4) 

0.062 

0.334 

0.712 

0.934 

S-EQ 

0.042 

0.256 

0.546 

0.818 

S-STD 

0.052 

0.264 

0.558 

0.844 

S-OPT 

0.052 

0.384 

0.662 

0.904 

S-OPT(l) 

0.052 

0.384 

0.662 

0.904 

S-OPT(2) 

0.052 

0.384 

0.662 

0.904 

S-OPT(3) 

0.052 

0.384 

0.662 

0.904 

S-OPT(4) 

0.052 

0.384 

0.662 

0.904 

W-EQ 

0.050 

0.268 

0.624 

0.874 

W-STD 

0.052 

0.272 

0.632 

0.876 
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Table  5.7.  Power  of  the  tests  for  Exponential  distribution  with  a = 0.6, 

V = 0.75.  and  sample  size  = 40. 


Tests 

II 

• 

o 

o 

e = 1.33 

e = 1.66 

e = 2.00 

e = 2.33 

PR-EQ 

0.040 

0.152 

0.364 

0.614 

0.784 

PR-STD 

0.042 

0.136 

0.354 

0.580 

0.762 

PR-OPT 

0.042 

0.250 

0.512 

0.706 

0.864 

PR-OPT(l) 

0.042 

0.252 

0.512 

0.708 

0.858 

PR-OPT(2) 

0.042 

0.252 

0.512 

0.712 

0.860 

PR-OPT(3) 

0.042 

0.240 

0.502 

0.704 

0.848 

PR-OPT(4) 

0.042 

0.196 

0.460 

0.680 

0.834 

S-EQ 

0.050 

0.138 

0.302 

0.508 

0.676 

S-STD 

0.050 

0.138 

0.308 

0.540 

0.714 

S-OPT 

0.050 

0.214 

0.448 

0.636 

0.776 

S-OPT(l) 

0.050 

0.214 

0.448 

0.636 

0.776 

S-OPT(2) 

0.050 

0.214 

0.448 

0.636 

0.776 

S-OPT(3) 

0.050 

0.214 

0.448 

0.636 

0.776 

S-OPT(4) 

0.050 

0.214 

0.448 

0.636 

0.776 

W-EO 

V 

0.040 

0.170 

0.392 

0.638 

0.818 

W-STD 

0.040 

0.156 

0.378 

0.614 

0.788 
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Table  5.8.  Power  of  the  tests  for  Exponential  distribution  with  a = 0.6, 

V = 1.25.  and  sample  size  = 40. 


Tests 

e = 1.00 

e = 1.33 

e = 1.66 

e = 2.00 

6 = 2.33 

PR-EQ 

0.040 

0.188 

0.462 

0.730 

0.896 

PR-STD 

0.042 

0.182 

0.446 

0.712 

0.898 

PR-OPT 

0.042 

0.266 

0.556 

0.762 

0.892 

PR-OPT(l) 

0.042 

0.268 

0.556 

0.764 

0.894 

PR-OPT(2) 

0.042 

0.270 

0.554 

0.766 

0.894 

PR-OPT(3) 

0.042 

0.252 

0.550 

0.760 

0.890 

PR-OPT(4) 

0.042 

0.202 

0.506 

0.724 

0.870 

S-EQ 

0.050 

0.162 

0.380 

0.628 

0.828 

S-STD 

0.050 

0.152 

0.378 

0.640 

0.828 

S-OPT 

0.050 

0.228 

0.478 

0.684 

0.842 

S-OPT(l) 

0.050 

0.228 

0.478 

0.684 

0.842 

S-OPT(2) 

0.050 

0.228 

0.478 

0.684 

0.842 

S-OPT(3) 

0.050 

0.228 

0.478 

0.684 

0.842 

S-OPT(4) 

0.050 

0.228 

0.478 

0.684 

0.842 

W-EQ 

0.042 

0.214 

0.480 

0.752 

0.918 

W-STD 

0.040 

0.202 

0.472 

0.746 

0.918 
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Table  5.9.  Power  of  the  tests  for  Folded  Standard  Normal  distribution  with  a = 0.6, 

1/  = 0.75.  and  sample  size  = 40. 


Tests 

e = 1.00 

9 = 1.33 

9 = 1.66 

9 = 2.00 

9 = 2.33 

PR-EQ 

0.042 

0.158 

0.394 

0.676 

0.838 

PR-STD 

0.040 

0.150 

0.384 

0.626 

0.788 

PR-OPT 

0.040 

0.244 

0.540 

0.784 

0.912 

PR-OPT(l) 

0.040 

0.250 

0.542 

0.784 

0.912 

PR-OPT(2) 

0.040 

0.246 

0.542 

0.780 

0.910 

PR-OPT(3) 

0.040 

0.234 

0.524 

0.782 

0.910 

PR-OPT(4) 

0.040 

0.214 

0.492 

0.760 

0.890 

S-EQ 

0.050 

0.138 

0.302 

0.508 

0.676 

S-STD 

0.050 

0.138 

0.308 

0.540 

0.714 

S-OPT 

0.050 

0.214 

0.448 

0.636 

0.776 

S-OPT(l) 

0.050 

0.214 

0.448 

0.636 

0.776 

S-OPT(2) 

0.050 

0.214 

0.448 

0.636 

0.776 

S-OPT(3) 

0.050 

0.214 

0.448 

0.636 

0.776 

S-OPT(4) 

0.050 

0.214 

0.448 

0.636 

0.776 

W-EQ 

0.056 

0.136 

0.254 

0.456 

0.628 

W-STD 

0.050 

0.124 

0.246 

0.444 

0.616 
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Table  5.10.  Power  of  the  tests  for  Folded  Standard  Normal  distribution  with  a = 0.6, 

u = 1.25.  and  sample  size  = 40. 


Tests 

II 

• 

O 

o 

e = 1.33 

e = 1.66 

9 = 2.00 

9 = 2.33 

PR-EQ 

0.042 

0.206 

0.516 

0.770 

0.936 

PR-STD 

0.040 

0.206 

0.490 

0.750 

0.928 

PR-OPT 

0.040 

0.270 

0.588 

0.816 

0.916 

PR-OPT(l) 

0.040 

0.278 

0.592 

0.814 

0.916 

PR-OPT(2) 

0.040 

0.274 

0.594 

0.812 

0.914 

PR-OPT(3) 

0.040 

0.258 

0.578 

0.810 

0.914 

PR-OPT(4) 

0.040 

0.222 

0.542 

0.786 

0.900 

S-EQ 

0.050 

0.162 

0.380 

0.628 

0.828 

S-STD 

0.050 

0.152 

0.378 

0.640 

0.828 

S-OPT 

0.050 

0.228 

0.478 

0.684 

0.842 

S-OPT(l) 

0.050 

0.228 

0.478 

0.684 

0.842 

S-OPT(2) 

0.050 

0.228 

0.478 

0.684 

0.842 

S-OPT(3) 

0.050 

0.228 

0.478 

0.684 

0.842 

S-OPT(4) 

0.050 

0.228 

0.478 

0.684 

0.842 

W-EQ 

0.052 

0.152 

0.338 

0.588 

0.772 

W-STD 

0.046 

0.150 

0.344 

0.582 

0.786 
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Table  5.11.  Power  of  the  tests  for  Folded  Logistic  distribution  with  a = 0.6, 

u = 0.75.  and  sample  size  = 40. 


Tests 

e = 1.00 

e = 1.33 

e = 1.66 

e = 2.00 

e = 2.33 

PR-EQ 

0.060 

0.178 

0.400 

0.650 

0.808 

PR-STD 

0.060 

0.168 

0.374 

0.620 

0.780 

PR-OPT 

0.060 

0.272 

0.538 

0.762 

0.888 

PR-OPT(l) 

0.060 

0.272 

0.540 

0.764 

0.890 

PR-OPT(2) 

0.060 

0.272 

0.540 

0.760 

0.886 

PR-OPT(3) 

0.060 

0.244 

0.524 

0.752 

0.886 

PR-OPT(4) 

0.060 

0.166 

0.412 

0.642 

0.802 

S-EQ 

0.050 

0.138 

0.302 

0.508 

0.676 

S-STD 

0.050 

0.138 

0.308 

0.540 

0.714 

S-OPT 

0.050 

0.214 

0.448 

0.636 

0.776 

S-OPT(l) 

0.050 

0.214 

0.448 

0.636 

0.776 

S-OPT(2) 

0.050 

0.214 

0.448 

0.636 

0.776 

S-OPT(3) 

0.050 

0.214 

0.448 

0.636 

0.776 

S-OPT(4) 

0.050 

0.214 

0.448 

0.636 

0.776 

W-EQ 

0.052 

0.144 

0.268 

0.458 

0.634 

W-STD 

0.060 

0.140 

0.250 

0.438 

0.618 

Table  5.12.  Power  of  the  tests  for  Folded  Logistic  distribution  with  a = 0.6, 

V = 1.25.  and  sample  size  = 40. 


Tests 

9 = 1.00 

II 

GO 

00 

9 = 1.66 

9 = 2.00 

9 = 2.33 

PR-EQ 

0.060 

0.216 

0.486 

0.774 

0.922 

PR-STD 

0.060 

0.218 

0.464 

0.756 

0.906 

PR-OPT 

0.060 

0.292 

0.576 

0.804 

0.898 

PR-OPT(l) 

0.060 

0.298 

0.580 

0.810 

0.902 

PR-OPT(2) 

0.060 

0.300 

0.582 

0.804 

0.902 

PR-OPT(3) 

0.060 

0.262 

0.562 

0.796 

0.892 

PR-OPT(4) 

0.060 

0.174 

0.426 

0.652 

0.774 

S-EQ 

0.050 

0.162 

0.380 

0.628 

0.828 

S-STD 

0.050 

0.152 

0.378 

0.640 

0.828 

S-OPT 

0.050 

0.228 

0.478 

0.684 

0.842 

S-OPT(l) 

0.050 

0.228 

0.478 

0.684 

0.842 

S-OPT(2) 

0.050 

0.228 

0.478 

0.684 

0.842 

S-OPT(3) 

0.050 

0.228 

0.478 

0.684 

0.842 

S-OPT(4) 

0.050 

0.228 

0.478 

0.684 

0.842 

W-EQ 

0.052 

0.170 

0.340 

0.600 

0.786 

W-STD 

0.062 

0.166 

0.334 

0.574 

0.790 
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5.3  A Worked  Example 

We  consider  the  leukemia  remission  data  of  Freirch  et  al.  (1963)  to  analyze  the 
difference  between  the  remission  times  of  patients  in  control  and  treatment  groups. 
Leukemia  patients  sharing  similar  characteristics  were  paired  together  and  each  mem- 
ber of  the  pair  was  assigned  at  random  to  either  treatment  or  control  group.  The 
following  table  gives  remission  times  in  weeks. 


Table  5.1.  Paired  remission  times  for  Leukemia  patients.  Censored  observations  are 

indicated  by  asterisks. 


Obs. 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

Control 

1 

22 

3 

2 

8 

17 

2 

11 

8 

12 

2 

5 

4 

Treatment 

10 

7 

22* 

23 

22 

6 

16 

34* 

32* 

25* 

11* 

20* 

19* 

Obs. 

14 

15 

16 

17 

18 

19 

20 

21 

Control 

15 

8 

23 

5 

11 

4 

1 

8 

Treatment 

6 

17* 

35* 

6 

13 

9* 

6* 

10* 

As  can  be 

seen 

from  the  data, 

3,  iV„. 

- = 6, 

Nc+ 

= 12,  and  Nc- 

= 0.  A 

consistent  estimator  of  u is  given  by 

logNc^  - logNc- 

u = . 

logNs^  — logNs- 

Since  Nc-  = 0,  logNc-  is  undefined.  In  order  to  estimate  z/,  we  assume  that 
logNc-  = 0 and  we  get  u = 3.585. 
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The  normal  q-q  plots  for  {\zi\  Bj}^  j = 1, 2,  3,  indicate  that  normal  distributions 
fit  the  differences  well  within  each  class.  Of  course,  the  small  number  of  observations 
in  each  class  makes  the  reliability  of  q-q  analysis  somewhat  questionable. 

After  some  elementary  calculations,  we  have 


S - EQ  = 3.156,  S - STD  = 3.273,  S - OPT  = 3.602. 


The  corresponding  one  sided  p-values  are  0.0007,  0.0005  and  0.00015  respectively. 
From  the  PR-class, 


PR-  EQ  = 2.952  and  PR  - STD  = 3.384, 


and  the  corresponding  one  sided  p-values  are  0.0015  and  0.00035. 
Also,  we  have 


W -EQ  = 2.897  and  W - STD  = 3.459 

with  one  sided  p-values  0.00188  and  0.00027  respectively.  All  these  statistics  indicate 
that  the  null  hypothesis  should  be  rejected. 

Now,  to  calculate  the  PR-OPT,  we  have  to  specify  the  slope  a.  Specifying  the 
values  of  slope  as  0.6,  0.8,  1.0,  and  1.5,  the  estimated  optimal  PR  statistics  are  given 
by  3.439,  3.476,  3.495  and  3.515.  The  one  sided  p-values  corresponding  to  these 
statistics  are  0.00023,  0.00029,  0.00025,  and  0.00022  respectively. 

Therefore,  by  using  any  of  the  test  procedures  we  reject  the  null  hypothesis  of 
symmetry  and  conclude  the  remission  times  of  patients  in  treatment  group  is  longer 
than  the  remission  times  of  patients  in  control  group. 
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5.4  Conclusions 


The  simulation  study  of  Section  5.2  compared  powers  of  several  statistics  for  large 
(n  = 100)  and  moderate  (n  = 40)  sample  sizes.  In  most  of  the  cases,  we  considered, 
optimal  statistic  in  the  PR-class  emerged  as  the  best  statistic  in  terms  of  power  under 
the  alternatives.  Only,  when  j = 1,2,3,  have  exponential  distributions, 

W-EQ  or  W-STD  could  be  used  instead  of  PR-OPT. 

Regarding  the  optimal  statistics,  we  have  seen  that  the  power  of  PR-OPT  is 
slightly  low  when  the  value  of  a specified  in  the  calculation  of  optimal  weight  is  lower 
than  the  true  value  of  a which  characterizes  the  distribution.  Therefore,  in  selecting 
a value  for  a,  it  is  recommended  that  one  chooses  a “high”  value  for  a. 


REFERENCES 

Dabrowska.  D.  M.  (1990).  Signed- Rank  Tests  for  Censored  Matched  Pairs. 

Journal  of  the  American  Statistical  Association  85,  478-485. 

Freireich,  E.  J.,  et  al.  (1963).  The  Effect  of  6-Mercaptopourine  on  the  Duration 
of  Steroid-Induced  Remission  of  Acute  Leukemia  : A Model  for  Evaluation  of 
Other  Potentially  Useful  Therapy.  Blood  21,  699-716. 

Hajek,  J.,  & Sidak,  Z.  (1967).  The  Theory  of  Rank  Tests.  New  York:Academic  Press. 

Kalbfleisch,  J.  D.,  & Prentice,  R.  L.  (1980).  The  Statistical  Analysis  of 
Failure  Time  Data.  New  York:  John  Wiley  & Sons. 

Kaplan,  E.  L.,  & Meier,  P.  (1958).  Nonparametric  Estimation  From  Incomplete  Ob- 
servations. Journal  of  the  American  Statistical  Association  53,  457-481. 

Knuth,  D.  E.  (1984).  The  TgpCbook.  Reading,  Massachusetts:  Addison- Wesley. 

Lamport,  L.  (1986).  I^TgK:  A Document  Preparation  System. 

Reading,  Massachusetts:  Addison- Wesley. 

Leavitt,  S.  S.  & Olshen,  R.  A.  (1974).  The  Insurance  Claims  Adjuster  as 
Patient’s  Advocate:  Quantitative  Impact.  Report  for  Insurance 
Technology  Company,  Berkeley,  California. 

Miller,  R.  G.  (1981).  Survival  Analysis.  New  York:  Wiley. 

Popovich,  E.  A.  (1983).  Nonparametric  Analysis  of  Bivariate  Censored  Data. 

Ph.D.  dissertation.  University  of  Florida. 

Popovich,  E.  A.,  & Rao,  P.  V.  (1985).  Conditional  Tests  for  Censored  Matched  Pairs. 
Communications  in  Statistics  14(9),  2041-2056. 

Randles,  R.  H.,  & Wolfe,  D.  A.  (1979).  Introduction  to  the  Theory  of  Nonparametric 
Statistics.  New  York:  Wiley. 

Serfling,  R.  J.  (1980).  Approximation  Theorems  of  Mathematical  Statistics. 

New  York:  Wiley. 


108 


Wolfram,  Stephen  B.  (1988).  Mathematica.  Redwood  City, 

California:  Addison- Wesley. 

Woolson,  R.  F.,  & Lachenbruch,  P.  A.  (1980).  Rank  Tests  for  Censored 
Matched  Pairs.  Biometrika  67,  597-606. 


109 


BIOGRAPHICAL  SKETCH 


Aparna  RayChaudhuri  was  born  on  February  29,  1964,  in  Calcutta,  India.  After 
finishing  high  school  in  1982  she  went  to  Presidency  College,  Calcutta.  In  1986  she 
received  Bachelor  of  Science  from  University  of  Calcutta  with  honors  in  statistics  and 
received  P.C.  Mahalonobis  Award  for  graduating  at  the  top  of  the  class.  She  came 
to  graduate  school  at  the  University  of  Florida  in  Fall  1986  and  received  the  Master 
of  Statistics  in  1988.  She  expects  to  receive  the  degree  Doctor  of  Philosophy  at  the 
end  of  summer  1992.  She  is  a member  of  the  Institute  of  Mathematical  Statistics. 

While  in  graduate  school  Aparna  worked  as  a Teaching  Assistant  for  three  years. 
For  the  last  three  years,  she  has  been  working  as  a consultant  in  the  consulting  center 
of  the  Department  of  Statistics  within  the  Institute  of  Food  and  Agriculture  Sciences. 
Upon  graduation,  she  will  be  an  assistant  professor  in  the  Division  of  Mathematics, 
Statistics  and  Computer  Science  of  the  University  of  Texas  at  San  Antonio. 


no 


I certify  that  I have  read  this  study  and  that  in  my  opinion  it  conforms  to  accept- 
able standards  of  scholarly  presentation  and  is  fully  adequate,  in  scope  and  quality, 
as  a dissertation  for  the  degree  of  Doctor  of  Philosophy. 


Pejaver  V.  Rao,  Chairman 
Professor  of  Statistics 


I certify  that  I have  read  this  study  and  that  in  my  opinion  it  conforms  to  accept- 
able standards  of  scholarly  presentation  and  is  fully  adequate,  in  scope  and  quality, 
as  a dissertation  for  the  degree  of  Doctor  of  Philosophy. 


Ronald  H.  Randles^ 
Professor  of  Statistics 


I certify  that  I have  read  this  study  and  that  in  my  opinion  it  conforms  to  accept- 
able standards  of  scholarly  presentation  and  is  fully  adequate,  in  scope  and  quality, 
as  a dissertation  for  the  degree  of  Doctor  of  Philosophy. 


Malay  Ghosh  M 

o 

Professor  of  Statistics 


I certify  that  I have  read  this  study  and  that  in  my  opinion  it  conforms  to  accept- 
able standards  of  scholarly  presentation  and  is  fully  adequate,  in  scope  and  quality, 
as  a dissertation  for  the  degree  of  Doctor  of  Philosophy. 


Murali  Rao 

Professor  of  Mathematics 


This  dissertation  was  submitted  to  the  Graduate  Faculty  of  the  Department  of 
Statistics  in  the  College  of  Liberal  Arts  and  Sciences  and  to  the  Graduate  School 
and  was  accepted  as  partial  fulfillment  of  the  requirements  for  the  degree  of  Doctor 
of  Philosophy. 


August  1992 


Dean,  Graduate  School 


